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PSYCHOLOGICAL TESTS, INTELLIGENCE, AND 
FEEBLEMINDEDNESS* 


JOSEPH JASTAK 
Delaware State Hospital, Farnhurst, Del. 


INTELLIGENCE, GLOBAL OR GENERAL? 


The accurate appraisal of feeblemindedness depends on what is meant by 
intelligence and on the relation of intelligence to the over-all adjustments of people. 
Intelligence is admittedly a part function of the personality. There are other factors, 
both intrinsic and extrinsic, which determine the level and quality of global behav- 
ior. Yet by the very nature of the techniques and concepts now available, intelli- 
gence is treated as if it were all of behavior and as if it explained the totality of 
mental experiences. The global definitions of intelligence by Stoddard © and Wech- 
sler®) are examples of the pars-pro-toto fallacy. The intelligence quotient, as a 
practical but unitary measure, gives support to that fallacy. Psychologists realize 
that intelligence is a part function, but the methods at their disposal compel them 
to look upon it as a totality function. Some conveniently vacillate between the two 
point-of-views. 

To find out who is and who is not feebleminded, we need a system of person- 
ality analysis that is logically and psychologically congruent and that fits reality in 
most, if not in all, contingencies. Our aims are not easily attainable. Even if they 
were attained, the tremendous inertia associated with present practices would mili- 
tate against their acceptance. However, we must under no circumstances be satis- 
fied with the present state of psychological diagnosis. We must admit that our 
knowledge of what psychometric and projective tests measure is meager. We should 
learn from the mistakes of the past and from a critical study of the views which 
tend to perpetuate such mistakes. The object of this paper is to outline some of the 
assumptions which may have to be made in building a new clinical psychology. 
More specifically, I will tell you what I think intelligence is and what it is not, and 
how our clinical procedures may be changed to get in line with the facts. 

To begin with, intelligence is a general factor. It partly determines all human 
acts, abilities, and adjustments at all age levels. The generality of a trait should not 
be confused with totality or globalness. The effect of intelligence on all that humans 
do is partial. The problem before us is to find the exact degree to which it influences 
adaptive behavior. My hunch, based on 20 years of clinical experience and experi- 
menting, is that intelligence, as a general factor, has a relatively small share in 
personality adjustment, perhaps an average of 20 per cent of its total variance. 

Should my hunch be correct, then intelligence cannot be the ability to adjust, 
to learn, to reason, to perceive the truth, and to create new values. These global 
abilities depend on non-intellectual factors to the extent of 80 per cent of the var- 
iance. Intelligence may be a partner in these abilities but it is not the sole determin- 
ant. Instead of misidentifying intelligence with abilities, we may conceive of it as a 
dimensional concept abstracted from all abilities and adjustments. Intelligence is 
no more an ability than gravity is the object which permits its measurement. 

Intelligence, as most scientific constructs, is neutral in relation to the social use 
to which it is put. It is an ideal level toward which we strive irrespective of success 
or failure. If destruction of values is our aim, we will destroy in direct proportion to 
our capacity. If personality survival is achieved by irrationality, then intelligence 
supports irrationality. The nature and complexity of delusions, for example, varies 
to some extent with the degree of intelligence. A young patient, a successful college 
student, was asked to give three reasons why there are windows in our houses. His 
responses were: to put up Venetian blinds; to keep people from peeping in; to pro- 


*This article is a modification of a paper read at the Symposium on Mental Deficiency at the 
September 1950 convention of the American Psychological Association at State College, Pennsylvania. 
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tect our constitutional rights. Each response is an intelligently expressed defense 
against reality. His thinking is a good example of the intelligent failures so common 
in clinical studies. Failure to adjust per se is not and cannot be a sign of lack of 
intelligence. 

Intelligence is a latent capacity and not an observable ability. As such it may 
be abstracted from the highest degrees of success or failure. Both give equivalent 
results in the process of abstraction. On tests, the abstraction is usually made from 
success. In such instances it represents the level of maximum personality integration. 
In actual life, intelligence must frequently be judged from the complexities of failure. 
For example, many juvenile delinquents represent high-level failures to adjust to 
society. If intelligence is estimated from negative and destructive behavior, it is 
likely to stand for the level of maximum personality disorganization. 


INTELLIGENCE AND Group FAcrors 


Intelligence enters into any adjustment or ability. That is why it is a general 
factor. Yet approximately 80 per cent of the variance of any adjustment or ability 
is due to factors other than intelligence. Some of these non-intellectual traits are 
known as group factors. There are also factors specific to each adjustment and 
ability, but they are less important than are the group factors. A group factor differs 
from intelligence in that it yields moderate positive correlations with a cluster or 
group of abilities. There are probably many clustering traits within the personality. 
The abilities of each individual have a unique factorial composition in accordance 
with the differences in personality structure. Personality make-up and ability pat- 
terns are in the final analysis synonymous. 

Reading ability, for example, is determined by at least three factors. One of 
them is general intelligence. The other two are what Thurstone “ would call a verbal 
and memory factor. While Thurstone seems to consider verbalness and memory 
part of intelligence, we believe that all group factors lie outside the realm of intelli- 
gence. They represent character variables which control the major variance of in- 
dividual behavior. The so-called memory factor is in reality a motivational attri- 
bute. Motivation is a highly significant facet of the mind. It has many social ramifi- 
cations, but as a measurable trait it is ethically neutral. It may be briefly defined 
here as goal striving, persistence, ambition, self-control. If a person is well moti- 
vated for a socially positive goal, he may be astonishingly successful despite inferior 
intellect. Conversely, defective motivation in a genius may result in complete failure 
to adjust. 

From this it follows that, even though both factors operate simultaneously in 
the same individual, their relative strengths vary at random to each other. They are 
capable of re-inforcing, or opposing each other, or of striking a compromise. Further- 
more, one trait may serve as a compensatory medium for the other. The degree of 
motivation available to an individual is of course independent of the degree of his 
intelligence. What a person accomplishes in life partly depends on the combination 
of personality traits, general, group, and specific, and on the relationships between 
them. Group factors like motivation may be measured as independent functional 
unities only if the general factor of intelligence is duly accounted for and statistically 
neutralized. 


Tests AND MEASURES 


The most glaring weakness of the intelligence quotient is that it is a simple 
unity. Complex oneness can mathematically be represented by many simple unities 
arranged in realistically valid patterns and held together by one or more unifying 
principles. An intelligence or personality quotient cannot reveal the intra-individual 
complexities and relationships that interest the psychologist. Test profiles are 
equally inadequate media for the charting of personality properties. They are merely 
a more or less random array of whatever we happen to find. A science of personality 
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will not evolve from techniques which happen to be convenient. In our search for 
the truth we must work systematically and organize the uncovered facts in a cogent 
manner compatible with reality. 

Several features of psychological tests are more important than intelligence 
quotients. The ability range is diagnostically significant. It varies in extent because 
different ranges may give identical quotients. It varies in quality because identical 
ranges are characterized by different combinations and permutations of abilities. 
Inter-ability relationships disclose qualities of personality functioning. Intra-test 
seatter has good diagnostic potentialities. Test contents may give clues to attitudes 
and stresses unrevealed in a formal statistical analysis. 

Intelligence may not be identified with any one ability no matter how much we 
think of that ability as being ‘intellectual’. Tests of analogies and similarities, 
whether verbal or spatial, are excellent clinical tools, but they have no special merits 
as tests of intelligence. If so used, the majority of diagnoses based on them will turn 
out to be erroneous. If used on college students, most of the variance of such tests 
is taken up by group factors or personality dynamics. College students are highly 
homogeneous groups in regard to the factor of intelligence. If a trait is neutralized 
by empirical selection, it cannot show positive correlations with tests sensitive to 
its presence. 

Scores on analogies tests may be somewhat related to academic success. Para- 
noid behavior and withdrawal from reality may also be related to academic success. 
Analogies tests may favor unrealistic people. Therefore, the aggressive paranoiac 
may have a better chance of getting into college, if analogies tests are used in the 
selection, than will an equally intelligent student who is less circumspect and better 
adjusted. 

Ikach person expresses his potential capacity through different adjustments and 
abilities. To obtain a valid measure of his capacity, we have to try him out on a 
large number of different abilities or study him in many different situations. A com- 
pletely unbiased measure of intelligence can be had only from an exhaustive study 
of the entire repertory of human abilities. Occasionally, a single test answer may be 
suggestive of a particular problem or some special diagnosis, but psychologists 
should want to know much more about people than their psychiatric diagnoses. 

As a rule, abbreviated test procedures are not good clinical psychology, not 
even in the hands of an experienced clinician. If a human trait has a variable effect 
on each ability in any one person, then no trait can be accurately measured in all 
persons by means of one ability. The dynamics of multiple dimensions in simultan- 
eous action are so kaleidoscopic that psychology will always depend on many tools 
and many reference points for a comprehensive understanding of personality struc- 
ture. 

Reliable measures obtained from a large number of different ‘abilities tend to 
array themselves in patterns typical of individual personalities. In independent 
population samplings, scores obtained from a large number of different abilities 
have an equal chance of occurring at any position of the intra-individual range. 
When test scores are ranked according to size, from highest to lowest ability for each 
person, and when the scores of each rank are correlated with each ability, the cor- 
relations will be highest at the median ranks and gradually decrease toward the 
extremes of the ranks. This indicates that extreme scores are determined by fewer 
factors than are those near the central tendency of the distribution. The highest 
and lowest abilities of a person are psychologically purer than are his intermediate 
abilities. As we proceed from the median ranks toward both ends of the ability 
range, the effect of group factors is minimized and the effect of the general factor is 
maximized. In normally heterogeneous samplings, even the lowest correlations be- 
tween reliably measured abilities will be significantly positive. 

An average of all the correlations between the scores of extreme ranks and all 
abilities may well approach the true theoretical correlation between intelligence 
and any human adjustment. The correlational base of the general factor is evident 
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Fig. 1. Schematic graph showing the size of correlation coefficients when test scores are ranked 
and the scores of each rank correlated with each ability. S stands for specific factors and E for errors 
of measurement. The base of the general factor and the superimposed group factors explain the 
possible meaning of the correlational distribution. 


only at the ends of the distribution as in fig. 1. The higher correlations near the 
central tendency are probably due to the cumulative effects of a number of group 
factors superimposed upon the base of the general factor. The group factors are 


arranged in clusters at different points of the range in different people, thus being 
independent of the general factor and of each other. 


ALTITUDE QUOTIENT 

Intelligence is a capacity. Its correlation with top and bottom abilities, we be- 
lieve, is theoretically the most accurate. Clinically, the highest abilities of a person 
are most closely related to native potentiality as a general factor. These considera- 
tions have led to the adoption of what we call the altitude quotient as a measure of 
capacity in contrast to the centroid measures of the group factors. In clinical usage, 
the altitude quotient is an average of the three highest abilities. It is helpful not 
only as an index of latent capacity, but, along with the basal quotient which is an 
average of the three lowest abilities, a useful reference point from which the non- 
intellectual group factors may be measured. By this means, the actual independ- 
ence of the isolated factors may be empirically demonstrated. 

We have compared altitude quotients and 1Q’s in a number of normal and ab- 
normal populations and found them to be highly correlated. Pearson r’s ranged from 
.79 in a group of 200 nursing students to .93 in a group of 200 male State Hospital 
patients. These correlations are higher than anticipated because an average of three 
abilities was used instead of the theoretically more appropriate procedure of using 
the highest ability. 


MentAL DEFICIENCY 


What inferences can be made concerning the diagnosis of feeblemindedness 
in the light of the preceding discussion? Feeblemindedness, in our view, is a de- 
ficiency of the general but non-global factor of intelligence™. Since the general 
factor partly affects all abilities and adjustments, it cannot be defective unless re- 
tardation is also general. A person is feebleminded only if all his abilities are uni- 
formly reduced to the lowest levels of a random population sampling. 
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Persons who are defective in some abilities and not in others are not feeble- 
minded but suffer from defects in character. This principle is clinically valid regard- 
less of the social valuation placed upon the defective cluster of abilities. If a pre- 
dominantly verbal scale is used, approximately three out of every four defective 
ratings it yields represent defects in personality organization and not in intelligence. 
The same thing is true of a performance scale. The more biased our ability samplings 
and our population samplings, the less we know about any of the measurable factors 
of the mind. 

Thus a clearcut deficiency in verbal abilities by itself must never be considered 
a sign of feeblemindedness. It may be a sign of deaf-mutism, cerebral palsy, epil- 
epsy, psychogenic blocking, foreign-language background, educational neglect, and 
other group factors. A person afflicted with any of these conditions may be of 
superior, average, or defective intelligence. His true capacity may be estimated 
only from a large number of non-verbal ability clusters. 

A valid and reliable altitude quotient cannot be obtained from abbreviated 
techniques. The use of altitude quotients favors the comprehensive examination of 
a person. It also makes sure that no person, regardless of the size of his 1Q, is diag- 
nosed as feebleminded unless the defect extends over all measured abilities. Since 
the test levels near the IQ are influenced mostly by group factors, low 1Q’s are in the 
majority of cases due to clustering defects. Personality disturbances of all types may 
give low or defective 1Q’s in people of relatively high levels of intelligence. 

Feeblemindedness and unadjustability are not the same. Defective intelligence 
merely lowers the level at which the individual succeeds or fails. It does not prevent 
adequate social adjustment within that level. If personality were uni-dimensional 
and intelligence global, all feebleminded persons would automatically be useless to 
society or themselves. Fortunately, personality is multi-dimensional and intelli- 
gence is not global. Since most of the variance of mental functioning is due to a 
variety of character attributes and since these attributes may be normal even in 
persons of defective intelligence, a considerable proportion of the féebleminded are 
socially useful and adjustable people. 

Intellectually deficient persons with simultaneous defects in personality have : 
greater chance of being committed to State Colonies than have intellectually de- 
ficient persons with otherwise normal personalities. Many morons never come to 
the attention of mental clinics because they present no greater difficulties than does 
the average, law-abiding citizen. Intelligent but troublesome persons do come to 
the attention of clinies and colonies. Severe and chronic personality handicaps widen 
their ability range, increase the unreliability of their actions, and cause their 1Q’s 
to be defective, even though the patients are bright. Their brightness happens to 
be used mainly for irritating maladaptive practices which are erroneously identified 
with lack of intelligence. The level and complexity of their failures may be objective- 
ly estimated from the altitude quotient. 


DracNostic Suirrs Dun tro ALrTiruDE 

To establish the effeet which a change from the 1Q to the AQ has on the diag- 
nosis of intelligence, we have surveyed 1596 active clinic records. Of this number, 
381 cases or about 24 per cent had 1Q’s below 70 on one of the following scales: 
Stanford-Binet, Wechsler-Bellevue, Cattell Infant Scale, or Wechsler Children 
Scale. The ages of these patients ranged from 4 months to 60 years. 

On the basis of the altitude quotients, only 123 patients or about 32.5 per cent 
of the group were diagnosed feebleminded. The remaining 258 patients had diag- 
noses ranging from inferior to superior intelligence. The highest altitude. quotient 
was 128 obtained by a patient whose Stanford-Binet quotient was 67 and who 
suffered from grand-mal epilepsy. The highest IQ which also gave a defective AQ 
was 66. 

Among white patients 36 per cent were diagnosed feebleminded, among colored 
patients only 25 per cent were so diagnosed by means of altitude quotients. A col- 
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ored patient with an IQ below 70 is less likely to be feebleminded than is a white 
patient with a similar 1Q. The proportion of males among those diagnosed as feeble- 
minded was 54 per cent, that of females 46 per cent. 

The psychiatric diagnoses of those who had 1Q’s below 70 but were not feeble- 
minded on the basis of their AQ’s were: primary behavior disorder, personality dis- 
turbance 47; conduct disturbance 83; personality and conduct disturbance 26; 
epilepsy 19; cerebral palsy 14; psychopathic personality 13; neurosis 13; glandular 
dysfunction 12; schizophrenia 11; manic-depressive psychosis 4; congenital language 
disability 4; chronic alcoholism 3; general paresis 3; hydrocephalus, encephalitis, 
avitaminosis, brain injury, deafmutism, simple adult maladjustment—1 each. 

Many of these patients were followed up in the out-patient or in-patient facilities 
of the State of Delaware. Their therapeutic progress usually results in the improve- 
ment of one or more of the centroid personality traits. Prognostic outlook generally 
depends on the number, severity, and chronicity of the nonintellectual defects. 


SUMMARY 


Intelligence is not a global trait but a general and pervasive part function of the 
personality. Feeblemindedness can be differentiated from other personality defects 
by means of the altitude quotient which approximates the level of maximum person- 
ality integration but does not preclude successful adjustment to life. High intelli- 
gence by itself does not insure normal adjustment or high ability. It may eventuate 
in highly intelligent failure. Success and failure are functions of the whole organism. 
This in turn is a composite of many mutually independent traits and influences, in- 
herent and environmental. The differential effects of these traits and influences are 
measurable by traditional psychometric methods. 
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A STUDY OF THE VALIDITY OF SOME HYPOTHESES FOR THE 
QUALITATIVE INTERPRETATION OF THE H-T-P FOR CHILDREN OF 
ELEMENTARY SCHOOL AGE: IL. SEXUAL IDENTIFICATION* 


ISAAC JOLLES 


Illinois State Department of Public Instruction 


INTRODUCTION 

From the very first the writer felt that the H-T-P had possibilities with children. 
Since the test had not been validated or standardized on children, only the PDI 
(post drawing interrogation) was used for interpretation. It was found that a great 
deal could be learned about the feelings and attitudes of children with the drawings 
being used only as a frame of reference for the PDI questions. For example, one 
mentally deficient eight-year-old girl replied to the question, ‘‘ What does that 
person need most?” by saying, ‘A really new dress that nobody else has wore.” 

Subsequently, the drawings were interpreted on a somewhat experimental basis 
with the result that such interpretations often proved very significant in working 
with the children. After about a year of experience with the H-T-P in this manner, 
the test was introduced to other psychologists in Illinois public schools. They used 
the test sparingly at first, more frequently later, and on the whole they seemed to be 
impressed with the results. In spite of the fact that our observations seemed to 
confirm Buck’s“? hypotheses for adult subjects, the writer felt that these hypotheses 
should be tested experimentally and that some basis for a general frame of reference 
should be established. Consequently, this study was undertaken. 


THE PROBLEMS 


The concepts or hypotheses that the writer has planned to investigate are: 
(1) Do children of grade school age tend to draw persons of their own sex? (2) Does 
the tree which is drawn so as to resemble the phallus have sexual significance or is 
this merely characteristic of young children’s drawings and represents only an im- 
mature percept? (3) Do the interpretations applied to placement of the drawings 
on the page, i.e., on the left or right, apply to children of grade school age? Is the 
psychological center, as defined by Buck“?, a valid frame of reference for children? 
(4) What effect does age of the child have upon space constriction, and how does this 
affect our interpretation of space constriction in drawings of children? (5) What is 
“normal perspective” in the house for children of various age levels? Also, there is 
the problem of standardizing the quantitative aspects of the test for this age group. 
The data from this study will be made available to John N. Buck for this purpose. 


Tur SuBJECTS 


The subjects for this study are children in the public schools in Illinois ranging 
in age from 5 through 12, the age being figured according to the nearest birthday. 
Their placement in school ranges from kindergarten through sixth grade. Approx- 
imately 1000 drawings were obtained from Galesburg, 500 from Lisle, 500 from Elm- 
hurst, 1500 from Springfield, and about 5000 from Decatur. Thus, there are approx- 
imately 8500 sets of drawings available for research purposes. In Decatur drawings 
were obtained from the entire school body, i.e., kindergarten through sixth grade, 


*The writer wishes to express his deep appreciation for the splendid way in which Martha Sehiesh- 
er, Superintendent of Schools at Lisle, Maurice Turner, Superintendent of Schools at Elmhurst, R. V. 
Lindsey, Superintendent of Schools at Galesburg, Lester Grant, Superintendent of Schools at De- 
eatur, and J. E. Bohn, Superintendent of Schools at Springfield, and their respective staffs of principals 
and teachers cooperated to make this study possible. Special mention should be made of Miss Char- 
lotte Meyer, Elementary Supervisor of the Decatur Public Schools, for her efforts in obtaining draw- 
ings from over 5,000 children. The writer is also very much indebted to Dr. William Sloan of the 
Lincoln State School and Colony, Lincoln, Illinois, for his advice and help in the statistical treatment 
of the data. 
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except for those who may have been absent on the day the drawings were made. In 
the other areas a good representative sampling was obtained, for the participating 
schools form a good cross section of the various socio-economic groups in their res- 
pective cities. The metropolitan areas of Peoria and Chicago are not represented 
here, but the participating school districts contain almost all occupational groups 
in the state except for the river front workers. 

This study attempts to test the first of the five hypotheses proposed above, 
namely, the hypothesis that children of grade school age tend to draw persons of 
their own sex. All of the drawings were not used in this study but the representative 
sampling was maintained. This was easily done because all of the drawings were 
submitted in groups according to schools, and the writer is very familiar with the 
areas covered by these schools. Some of the drawings had to be discarded because 
instructions had not been followed. For example, some were done in crayon, some 
in heavy pencil, one or two in ink, ete. In this particular study, approximately 2700 
drawings were appraised, but about 100 of these had to be discarded because of the 
fact that the sex of the person could not be identified or because directions had not 
been followed. Also, an additional 38 drawings could not be used in the statistical 
treatment because the persons drawn had ambivalent sexual characteristics. This 
left a total of 2560 subjects. An analysis of an additional 1500 drawings was made 
by another worker on the problem, but it arrived in the office two days after the 
statistical treatment had been completed. Since a precursory review of this data 
seemed to be very much in agreement with that already obtained, it did not seem 
necessary to do the statistical work all over again. 


PROCEDURE FOR OBTAINING THE DRAWINGS 


The drawings were obtained by teachers from the children in their respective 
classrooms, and this was done on a group basis. Essentially, the instructions used 
for the individual examination were followed except for such additional instructions 
as to warn the children against copying from their neighbors. First the house was 
drawn, then the tree, and finally the person. The drawings were done on sheets of 
paper of legal size folded so as to make the equivalent of the standard H-T-P form 
except for the absence of the printed word at the top of each drawing space. 

There were a few violations of standardized procedure. Pencils varied from 
No. 1 to No. 3 lead instead of the standard No. 2 suggested for all drawings. ‘The 
paper on which the drawings were made was not always of the same texture as that 
of the standardized form. These violations could not be helped because there were 
no funds available for this research, but the writer feels that these violations are not 
important as far as this research project is concerned. 

The chief objection which might be raised regarding this procedure concerns the 
possible differences in reactions of children drawing in a classroom situation under 
the supervision of the teacher and possibly under the influence of the other children 
in the class from the reactions of those who draw for a psychologist in a testing situa- 
tion. If this objection is valid, then the results obtained are to be questioned. One 
precaution was taken in this respect. A check was made on the type of trees the 
Springfield teachers had been instructing the children to draw, and then the child- 
ren’s drawings were examined to see if this was an obvious influence upon their re- 
productions. No such obvious influence seemed to be in the picture. 

After the children’s drawings had been completed, the teachers (in some cases 
the children) recorded the children’s ages in years and months and their sex. When 
the information was available, the teachers recorded the M.A. and I.Q. obtained 
on the most recent intelligence test, the C.A. at the time of the test, and the 
name of the test. 


ANALYSIS OF RESULTS 


Tables 1 and 2 give the raw data for males and females respectively. These 
tables show the number and per cent of children in the various age groups drawing 
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human figures of their own and opposite sex as well as ambivalent figures. For 
example, 81.5, of the 5-year-old males drew male figures, 17.5°; drew female per- 
sons and 1°, drew an ambivalent person. The data from tables 1 and 2 were com- 
bined for the purposes of testing the hypothesis that children tend to draw persons 
of their own sex. Chi Square was used for the purpose of testing this hypothesis“. 
Table 3 represents the contingency table for obtaining the Chi Square. In the first 
column we see that 1083 males drew males and that 199 males drew females. Like- 
wise in the second column we see that 257 females drew males and that 1021 females 
drew females. The figures in parentheses represent the theoretical frequencies or 
the frequencies which one would expect from a chance distribution. The obtained 
Chi Square was 10.68 which is significant at the 1°) level of confidence ®. Thus, 
Buck’s hypothesis is verified in this particular group. 

The question which now arises is: Can we make this interpretation for the 
younger children as well as for the older children? Again Chi Square was used for 
analysis of the data, and table 4 is the contingency table. The children were arbi- 
trarily divided into two groups, young and old. The young group included ages 5 
through 8, the old group ages 9 through 12. In Table 4 it can be seen that 939 of 
the young group drew human figures of like sex and 234 drew human figures of the 
opposite sex. Likewise 1165 of the older group drew persons of their own sex while 
£56 drew persons of the opposite sex. The obtained Chi Square is 6.716 which is 
significant at the 1°; level of confidence. By comparing the theoretical frequencies 
with the obtained frequencies in each cell of the contingency table, it can be seen 
that the younger children tend to draw more human figures of unlike sex than the 
older children. 

In view of this interesting finding one may raise still another question: Is there 
a sex difference among the younger group of children? Tables 5 and 6 are the con- 
tingency tables for males and females respectively. The figures in these tables are 
read in the same manner as those in table 4 except that the subjects are confined to 
one sex. The obtained Chi Square for the males is 18.87 which verifies the hypothesis 
established above and is significant at the 1°, level of confidence. However, the 
Chi Square which is obtained for the female group is only .176 which is significant 
between the 50 and 70°, level of confidence or essentially a chance difference. 

Table 2 reveals a tendency for 11 and 12-year-old girls to draw more figures of 
the opposite sex than the younger girls and also the males. The writer hesitates to 
regard this as a definite trend without data from 13 and 14-year-old children. 
Nevertheless, it raises the question whether there is a relationship between this 
phenomenon and the growing custom among adolescent girls to wear masculine 
clothing (blue jeans, slacks, etc.). Research along this line might give us some inter- 
esting clues regarding differences in the psychosexual development of girls and boys. 

The fact that a sex difference is indicated suggests the need for further analysis 
of the data. The difference between 11 and 12-year-old girls and boys was tested 
for significance by the use of Chi Square (Table 7), and the resulting Chi Square of 
51.9198 is significant at less than the 1°, level of confidence. A further test of sex 
difference was then made for the entire group (Table 8). The obtained Chi Square 
of 8.9761 is significant at less than the 1°, level of confidence. Therefore, there is 
evidence that, on the whole, girls tend to draw more human figures of the opposite 
sex than do boys. 

These trends indicate a need for research in the field of psychosexual develop- 
ment, especially in the area pertaining to sexual identification. Certainly it is im- 
portant for the clinician to know why girls tend to identify with male figures to a 
greater extent than boys to identify with female figures (assuming that further 
validation studies confirm the present findings). Also, clinicians should know why 
the opposite trend appears among the young children. In the latter case there seems 
to be a logical hypothesis to use for such a study. It is possible that during infancy 
and even during most of the pre-school years the child has closer relations with the 
mother than with the father. Thus there might be a greater tendency for the young 
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child, male or female, to identify with the mother rather than the father. Later 
there is, perhaps, more and more relationship with the father, thus permitting the 
boys to become more strongly identified with the male sex. Assuming this to be 
true, we have further validation of Buck’s hypothesis that the sex of the drawn 
person may represent the felt sex role of the subject. 

For the present we have to be content with the facts without the reasons for 
them. Thus in interpreting the H-T-P’s of children of elementary school age we 
must take cognizance of the sex differences at the various age levels. For example, 
one would be cautious in interpreting homosexual trends in a six-year-old boy who 
drew a female person. Before one is justified in making such an interpretation, there 
should be evidence in the case history of a father-son or mother-son relationship 
that would be conducive to the development of a homosexual personality. Also, 
such parental conflict should show up in the H-T-P. The same precaution needs to 
be taken in the case of an eleven or twelve-year-old girl who might draw a male 
person. Actually, of course, a competent clinician would always take all the facts 
into consideration before interpreting any one aspect of the H-T-P. 


CONCLUSIONS 
1. Buck’s hypothesis that subjects tend to draw persons of their own sex 
applies to Illinois school children ranging in age from 5 through 12. In all probability 
this generalization is applicable to all children. 


bs 
-- 


It is also suggested that Buck’s hypothesis that the sex of the drawn person 
tends to represent the felt sex role of the subject is applicable to children. 


3. One has to be particularly cautious in evaluating the significance of drawn 
persons of the opposite sex among 5, 6, and 7-year-old males and among 11 and 
12-year-old females since our results suggest that such trends are typical at these age 


levels. 
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1948. 
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THE CORNELL MEDICAL INDEX—HEALTH QUESTIONNAIRE. III, 
THE EVALUATION OF EMOTIONAL DISTURBANCES* 


KEEVE BRODMAN, M. D., ALBERT J. ERDMANN, JR., M. D., IRVING LORGE, PH.D., 
CHARLES P. GERSHENSON, PH.D., AND HAROLD G. WOLFF, M. D., WITH THE 
TECHNICAL ASSISTANCE OF BARBARA CAPLES 


DESCRIPTION OF THE HEALTH QUESTIONNAIRE 


The Cornell Medical Index-Health Questionnaire (abbreviated C. M. I.) is a 
four page letter size sheet on which are printed 195 questions corresponding closely 
to those usually asked in a detailed and comprehensive medical interview including 
many on the psychological aspects of illness. Patients answer questions on the 
C. M. I as accurately and as honestly as they do to physicians in oral interview “). 
Interpretation of the C. M. I. yields correct psychiatric ®) and comprehensive 
medical“? appraisals of the patient. This paper describes the use of these data in 
appraising the severity of emotional disturbances ranging from minor and transient 
disturbances of mood or feeling pattern to more serious neuroses, psychoses and 
character disorders. 


Subjects. Five samples were examined. The New York Hospital sample (5,121) 
consists of all patients newly admitted during the year beginning July 1, 1948 to the 
medical and surgical out-patient departments, and who completed C. M. 1.’s. The 
New York Hospital is a teaching general hospital for Cornell University Medical 
College students. The “neurotic”? New York Hospital sample (526) are those 
patients of the above sample who were diagnosed as emotionally disturbed during 
hospital examination. 

The New York City ‘‘normals’’ (610) are random New York City men and 
women, ostensibly healthy, who completed C. M. 1.’s. collected by several hundred 
college students in 1949. It is not known how many “normals” are emotionally 
disturbed. The New York Hospital employee ‘“‘normals”’ (459) are those newly em- 
ployed in 1949. The Brooklyn Veterans Administration psychiatric out-patients 
(371) are those treated in the neuro-psychiatric section during July 1948 to March 
1950. All individuals in this sample were men and chiefly between the ages 25 to 
50. About a quarter had been diagnosed schizophrenic, more than half neurotic, and 
the others as having various personality and psychotic disorders. 

All the individuals studied completed C. M. I.’s. Other data were abstracted 
from hospital records. The C.M.I.’s. completed by patients were not available to 
the hospital physicians examining the patients. Various scoring methods were 
studied and compared for differentiating ability. 


RESULTS 

In Table 1 are shown the frequency distribution of ‘“‘Yes’’ responses of the five 
samples. In comparing the number of ‘‘Yes” responses of different samples, scores 
of 50 and 30 were chosen as critical scoring levels; these may be used to compare 
other populations with the ones reported here. Other test scoring methods were 
investigated, using the number of ‘‘Yes” answers to selected questions on the C.M.I. 
For example, one group of questions selected corresponded closely to those on the 
Cornell Index Form N2“. Neither this nor any other test scoring method using 
selected questions was found to be as discriminating between “normal” and ‘‘neu- 
rotic”’ samples, or as consistent, as the use of the entire C.M.I. 


7 rom The New York Hospital and the Departments of Medicine (Neurology) and Psychiatry, 
Cornell University Medical College; and Teachers College, Columbia University. The work described 
in this paper was done under a contract, recommended by the National Research Council, between 
the Veterans Administration and Cornell U niversity. Copies of the Cornell Medical Index-Health 


Questionnaire may be obtained from the authors, The New York Hospital, 525 East 68th Street, 
New York 21, N.Y. 
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TABLE 1. CUMULATIVE FREQUENCIES OF PERCENTAGES OF Eacu Group Havine At LBAsT THE 
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Other manifestations of emotional disturbances on the C.M.I. were studied in 
the New York Hospital patients such as the number of questions answered both 
“Yes” and “No” >not answered at all, and with remarks or changes in the questions 
written in by the patients. The frequency distributions of “Yes” responses for these 
groups are shown in Table 2. 


Tasie 2. THe CumMuLatTiveE FREQUENCY DIsTRIBUTION OF THE NUMBER OF ‘YES’? RESPONSES ON 
rHE C. M. I. ror New York Hosprran MEN anp WoMEN Patients WHo ANSWERED THREE OR 
More Questions Born “YEs” anp “No”, Wao Omirrep ANSWERING Srtx oR More QUEsTIONs, 
AND Wuo AppEep Remarks or CHANGES TO THREE OR More Questions. THe REsuLts ARE Ex- 
PRESSED IN CUMULATIVE PERCEN1 
or “Yrs” RESPONSEs. 
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Tables 1 and 2 show that presumptive evidence on the C.M.I. of the patient 
having an emotional disturbance is: 


(1) Thirty or more “Yes” responses. 


(2) Three or more questions answered both “Yes’’ and “No’’. 
(3) Omitted answers for six or more questions. 
(4) Three or more remarks or changes in the questions. 


Clinical experience was believed to be at least as reliable a criterion for psycho- 
logical interpretation of the C.M.I. as are test scoring techniques. This belief was 
put to a test and found to be correct. Eight individuals without psychological or 
psychiatric training (four medical practitioners, two medical interns, a graduate 
nurse and a technician) made independent clinical interpretations of 191 C.M.I.’s. 
selected at raridom from those completed by patients attending the General Medical 
out-patient department, recording their estimates of the severity of each patient’s 
emotional disturbance as none, mild, moderate or severe. Only one of these, Practi- 
tioner 1, had ever used the C.M.I. before this test, and none knew of the tabulations 
above. 

The validity of their interpretations was tested first by comparing them with 
the psychiatric evaluations made during the patient’s hospital examination. In al- 
most every instance in which a diagnosis of an emotional disturbance had been made 
by a physician in hospital examination, the individuals interpreting the C.M.I. also 
made the diagnosis. With many patients, however, evidence of an emotional dis- 
order was found on the C.M.I. where this possibility had been overlooked in the 
hospital examination. Twenty-five of these patients were recalled for independent 
psychiatric appraisal and 15 returned; the psychiatric interpretations of Practi- 
tioner 1 with the C.M.I. were found to be correct in 13 cases.. The other two were 
foreigners with serious language difficulties. 

The interpretations of the other seven individuals were compared with those of 
Practioner 1, and found to be similar. To indicate the amount of agreement, sixteen 
square tables were developed from which contingency coefficients were computed. 
These values (a contingency coefficient of 0.86 indicates complete agreement) were 
0.68, 0.70, and 0.65 for the three other medical practitioners, 0.69 and 0.66 for the 
two medical interns, 0.65 for the nurse, and 0.67 for the technician. These agree- 
ments are as close as any clinical psychologic or psychiatric estimates are likely to be. 

The interpretations of the eight individuals examining the C.M.1.’s were similar 
to the results of test scoring the C.M.I.’s. For each patient, the interpretations of 
all examiners of the C.M.I. were summed (assigning numerical values to diagnoses of 
emotional disturbances: 0 for none, 1 for mild, 2 for moderate, and 3 for severe) and 
the sum compared to the numberof “Yes” responses on the C.M.L.’s. The two were 
in close agreement, as shown by a coefficient of correlation of 0.83. 


Tue TECHNIQUE OF Usine THE C.M.I. In PsycuraTric INTERVIEW AND 
EVALUATION 

The C.M.I. is used preferably together with oral interview in which the data 
are elaborated to yield qualitative information. The interview is facilitated by using 
the data as leads. \\ en used alone the C.M.I. can be interpreted psychiatrically 
in one minute. 

The most effective method of using the C.M.I has been found for the patient 
to complete it and for the psychologist or physician to review it, before interview is 
started. The interviewer then has available a large body of significant data to aid 
him in making tentative diagnostic interpretations and in conducting the oral inter- 
view. Except when inaccurate answers in the C.M.1. are suspected, only questions 
answered ‘‘ Yes”’ need be elaborated. 
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The form is examined first to determine the number and distribution of “Yes”’ 
answers, and the number and identity of questions answered both “Yes” and ‘‘ No”, 
not answered at all, and with remarks or changes written in. If the ‘Yesses’” are 
chiefly in one or two sections of the C.M.1., a localized body disorder is likely. If 
scattered throughout the C.M.I. an emotional disturbance is to be suspected, espec- 
ially if more than two appear on the last page of the C.M.1I. where are questions 
about the patient’s moods, attitudes and behavior. 

Finally, the most important examination of the C.M.I. is made. The specific 
questions answered ‘Yes’ are noted. This defines the symptomatology of the 
patient’s disorder. No rules can be given here on how to interpret symptoms. Clin- 
ical appraisal of psychological and psychiatric data must be based on the interpreter’s 
own training, knowledge, experience and insight. In any event, the data on the 
C.M.1. are so pertinent and comprehensive that correct clinical psychiatric inter- 
pretations are possible, 


Report oF CASES 


The following case histories illustrate evidences of emotional disturbances on 
the C.M.1. when there are only a few “ Yes” answers. 


Case 1. A disturbance manifest by a few but clinically significant “Yes” answers. A 
50 year old married housewife with one year of college education was admitted to the 
General Medical Out-Patient department with a complaint of severe headaches for 
four months. During hospital examination an emotional disturbance was discovered ; 
no structural disorder was found. 

On the C.M.1. there were only 20 ‘‘ Yes” answers, but the constellation of symp- 
toms Was diagnosed as psychiatrically significant. They included: A history of low 
blood pressure, a poor appetite, frequent flushing of the face, head pressures and 


pain, severe hot flashes and sweats, life looking hopeless, worrying getting her down, 
a family history of a nervous breakdown, and constantly feeling keyed up and jittery. 
Thus, an emotional disturbance was correctly identified by blind clinical appraisal 
of the data on the C.M.I. when the few “Yes” answers formed a psychiatrically sig- 
nificant constellation. 


Case 2. A disturbance manifest by a few but significant “Yes”? answers and many 
questions answered both “ Yes” and ‘‘ No.” A 54 year old widowed housewife sought 
medical advice for ‘terrible weak feelings.’? Her disorder was diagnosed as ‘‘ Anx- 
iety spells in an emotionally dependent individual. Menopause and death of mother 
aggravating factors.” 

The C.M.1. showed 11 ‘‘ Yes” answers, including feeling a choking lump in her 
throat, four complaints of fatigability, and having had a nervous breakdown. In 
addition, 19 questions were answered both “Yes” and ‘No’, the most important 
being spells of severe dizziness, numbness and tingling, severe nervous exhaustion, 
hot flashes and sweats, shaking and trembling, and being considered a nervous per- 
son. Based on the clinical syndrome of the questions answered “Yes’’, or both “Yes’”’ 
and “No”, the interpreters of the C.M.I. diagnosed an emotional disorder. This 
diagnosis, in agreement with that made in the hospital, would have been overlooked 
had only the number of “Yes” answers been considered. 


Case 3. A disturbance manifest by a few but important “Yes” answers and by remarks 
inserted by the patient. A 19 year old single woman, a typist with four years of high 
school education, was admitted to the General Medical Department complaining of 
drooping of the eyelids and difficulty with swallowing since the age of 12. The exam- 
ining physician believed she had schizophrenic tendencies, an impression shared by 
a psychiatrist. 

On the C.M.I. she had indicated only 12 ‘‘Yes’’ answers, but the constellation 
was psychiatrically significant. They included complaints of constant numbness 
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and tingling of parts of her body, twitching of the face, head or shoulders, being 
easily upset and irritated, and inserted remarks to six questions. Without reference 
to scoring levels, the C.M.I. was clinically interpreted as showing evidence of a sig- 
nificant emotional disturbance, an interpretation in keeping with the hospital psy- 
chiatric diagnosis. 


Case 4. A disturbance manifest by “Yes” answers, omitted answers, and inserted 
remarks. A man, a 36 year old hotel employee with two years of high school educa- 
tion, sought medical help for ‘‘a fast heart intermittently for six years and an irregular 
heart for six months.” In the hospital, the structural aspects of his disorder were 
diagnosed as ‘Heart disease, undiagnosed, with recurrent paroxysmal cardiac 
arrhythmias”, and the psychiatric aspects as ‘This patient has all the traits char- 
acteristic of a psychopathic personality.” A physician recorded the relation of his 
attacks of tachycardia to his emotions, chiefly to fear, anger, and anxiety, and pre- 
ceding such activities as sexual intercourse or bouts of drinking. 

On the C.M.I. there were 25 clinically significant ““Yes’’ answers, 12 omitted 
answers, and seven inserted remarks. This constellation was correctly interpreted 
as indicating an emotional disturbance which, since the number of “Yes” answers 
was below 30, would have been overlooked had the concept of critical scoring levels 
alone been used. 


COMMENTS 


The C.M.I. does not make automatic diagnoses. When interpreted, its data 
are useful in evaluating the patient’s psychiatric status. The severity of emotional 
disturbances may be estimated either by clinical appraisal of the data or by using 
the concept of scoring levels. The clinical method is appropriate when the psychia- 
tric status of a particular individual is to be evaluated. It is more discriminating 
than the use of a scoring system. It is rapid, can be used with subjects in a popula- 
tion without previously tested “‘norms’’, provides data that are useful leads in inter- 
view, gives qualitative as well as quantitative information about the patient’s psy- 
chiatric status, and yields accurate estimates in the hands of personnel without 
formal psychological or psychiatric training. When entire populations are to be 
compared, the use of a test scoring technique is indicated. This method, employing 
a standard criterion of the number of “Yes” answers is entirely independent of vary- 
ing skills of different interpreters. 

The C.M.I. has been found useful in the following situations: (1) In hospital 
and private medical practice, to assess the psychiatric aspects of the patient’s total 
medical problem. (2) In industry, to evaluate the employee’s psychiatric status as 
part of his total medical problem, industrial behavior and job placement. (3) In 
schools, for assessing the student’s emotions in relation to his medical status, schol- 
astic behavior and potentialities. (4) In psychiatric and psychologic practice, to aid 
in the rapid evaluation of the patient’s problems and in the establishment of rapport 
in interview. (5) In medical and psychological research, for rapid evaluations of 
each individual’s psychiatric status even when the professional staff is too small to 
interview each individual extensively. (6) In making surveys, to compare the 
probable number of individuals with emotional disturbances in one population with 
the number in others. 

In all these situations, the C.M.I. permits comprehensive general medical“ 
as well as psychiatric evaluations. 


SUMMARY 


The Cornell Medical Index-Health Questionnaire, without the physician’s 
participation, collects a large body of significant medical and psychiatric data for 
diagnostic evaluation. The data can be examined and interpreted rapidly. With 
these data as the only information about a hospital population, it was possible to 
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detect the presence of significant emotional disturbances. Two methods of psy- 
chiatric interpretation of the data on the Health Questionnaire were found to be 
effective: A test scoring technique that counts the number of complaints noted by 
the patient, and clinical appraisal of the constellation of complaints. Both yield 
similar evaluations, those by clinical appraisal being more discriminating. The 
Cornell Medical Index-Health Questionnaire is an appropriate instrument when it 
is necessary to evaluate quickly the psychiatric status of individuals, singly or in 
groups. 


BIBLIOGRAPHY 
BropMan, K., ErpmMann, A. J. gr., Lorce, I., ann Wourr, H. G. The Cornell Medical Index, 
An Adjunct to Medical Interview. J. A. M. A., 1949, 140, 530-534. 
BropMan, K., ErpMann, A. J. gr., Lorar, I., Gersnenson, C. A., anp Wotrr, H. G. The 
Cornell Medical Index-Health Questionnaire. IV. The Recognition Of Emotional Disturbances 
in a General Hospital. J. clin. Psychol. In Press. 
BropMan, K., ErpMann, A. J. yr., Lorae, I., anp Wotrr, H. G. The Cornell Medical Index- 
Health Questionnaire II. As A Diagnostic Instrument. J. A.M. A., 1951, 145, 152-157. 
Weiner, A., BropMan, K., MrrretMann, B., Wecusier, D., AND Wourr, H. G. The Comell 
Index: A Method For Quickly Assaying Personality And Psychosomatic Disturbances, To Be 
Used As An Adjunct To Interview. Psychosomatic Medicine, 1946, 8, 411-413. 





RELIABILITY OF GRAPHIC INDICES IN A PROJECTIVE TEST 
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INTRODUCTION 

Although the Draw-A-Person or DAP test is rather widely used as a clinical 
diagnostic method, it is only recently “> *: ® that efforts have been made to investi- 
gate the relationships among different variables used for diagnostic interpretation 
or to study the normative aspects of various traits. The present study extends this 
approach and is concerned with an investigation of reliability using a rating system 
for a number of graphic traits or variables appearing in the drawings of normal per- 
sons and of studying the consistency of these variables. Several different aspects of 
reliability are considered. 


PROCEDURE 


The drawings used in this study were obtained from 91 persons. Of these 66 
were men and 25 were women. The subjects were all members of courses in psycho- 
logy at the University of California at Los Angeles. Fifty of the subjects (25 men and 
25 women) were in the age range from 18 to 21 (mode =21), and the other 41 men 
werein the age range from 24 to 26 (mode = 25). 

The test procedure consisted of presenting the subjects with a blank piece of 
paper with the instructions, ‘‘ Draw a whole person.”’ Prior to this, reassuring com- 
ments were made that their performance would be used for research purposes only, 
that it would not be judged on artistic merit, and that it was important for each per- 
son to do his or her own work. Six minutes were allowed for this first drawing, after 
which the subjects were told to turn over their sheets of paper and draw a person op- 
posite in sex to the one just drawn. Again six minutes were allowed for the drawing. 
The subjects supplied certain personal data on the sheet of paper, including name or 
initials for identification of the drawing. 

Each of the subjects in this study made two sets of drawings, a male and female 
figure at each administration. The first administration of the test was made during 
the early part of a school semester and the second administration near the end of 
that semester. Thus about four months elapsed between the first and second ad- 
ministration. 

The drawings were rated using the authors’ rating system (described below) in 
order to study the following: 


(1) Consistency of the authors’ ratings on a sample of 90 drawings with re- 
ratings by the authors on the same Sample one week later. 


(2) Inter-rater reliability in the application of the authors’ rating system, which 
involved the use of three raters each independently rating a different sample of 30 
drawings (two samples included male figures only while the third consisted of female 
figures). Each of these sets of ratings was then compared with ratings made by the 
authors on the same drawings. 

(3) Test-retest reliability which involved comparisons of ratings by the avithors 
on two sets of 181 figures by the same persons on two administrations of the test. 
(One female figure was omitted by one subject.) 


DESCRIPTION OF RATING SysTEM 


The rating system used by the authors in this study consists of 21 graphic traits 
or variables in terms of which each drawing was rated. The traits used are as follows: 
Shading, Reinforcement, Erasures, Detail, Body Type, Hands, Eyes, Mouth, Nose, 
Hair, Feet, Breast Lines, Body Detail, Transparency, Extraneous Drawing, Position 
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on Page, Perspective, Symmetry, Position of Hands, Posture, and Lines. The trait 
“Breast Lines” was applied only to female figures. 

Each trait is described by a set of mutually exclusive descriptive categories, 
usually 10 for each trait, in terms of which the ratings were made. To illustrate, the 
variable for describing Shading was categorized as follows: 
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None of the following apply 

No shading 

Hair shaded 

Parts of face shaded 

Chest shaded 

Hands, fingers or forearm shaded 
Parts of elething shaded 

Crotch or genitals shaded 

Any 2 above 

Any 3 above 


th Ooh 
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RESULTS 

The results are presented and discussed in relation to the following three parts 
of the study: (1) Consistency or agreement of the authors’ ratings with their re- 
ratings; (2) agreement of other raters’ ratings with those of the authors; and (3) 
consistency of ratings by the authors on first and second administrations of the test 
to the same persons. All results are stated as per cent of agreement between the sets 
of ratings, i.e. per cent of cases in which the matched ratings are identical. Table 1 
summarizes these results. 


TABLE 1. Per Cent oF AGREEMENT BETWEEN Sets OF RATINGS 


Dimension Re-ratings Compared ratings Test-retest 

(90 drawings (90 drawings) (181 drawings) 
Shading 82. 
Reinforcement 91 
Erasures 86. 
Detail 90. 
Body Type 93. 
Hands 94. 
Eyes 90 
Mouth 87 
Nose 91. 
Hair 70. 
Feet 92 
Breast Lines (female figures) 95. 90 
Body Detail 97. 94. 
Transparency 98.4 93 
Extraneous Drawing 94.4 94 
Position on Page 98.4 94. - 
Perspective 98. 95 
Symmetry 92.3 52 
Position of Hands 91. 88. 
Posture 93.< 93. 
Lines 74.¢ 68. 
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65.6 59. 
73. 64. 
76.6 56. 
80. 78.1 
62.2 92. 
85.5 66. § 
92. 60. 
s+ 43.6 
90. 51.6 
86. 69.6 
90. 
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san percentage for all dimensions 90.: 83.4 





Comparing the variables with the highest agreement in 1 above, the authors’ 
re-ratings, and in 2, the raters’ ratings against the authors’, we find that Perspective 
is highest in both eases, with 98.9, agreement for authors’ re-ratings and 95.6% 
for raters’ ratings with authors’ ratings. Next in order of agreement are Position on 
Page and Transparency, with 98.9°7 and 94.4%, respectively, and 98.9°% and 93.30, 
respectively. Next is Body Detail with 97.8% and 94.4°(, and then Posture with 
93.3°; in both cases. Thus there is not only a high degree of consistency in rating 
these traits, but more remarkable is the fact that they fall in the same order in both 
authors’ re-ratings and raters’ ratings with authors’ ratings. 
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The trait giving the lowest percentage of agreement on the authors’ re-ratings is 
Hair, 70.0°7; second lowest is Lines, 74.4°7; and third, Shading, 82.2°7. For the 
raters’ ratings against authors’ ratings, the lowest percentage of agreement is on the 
Symmetry dimension, 52.29); then, Body Type, 62.27, and lowest, Shading, 65.6 ¢. 

The traits Hair and Lines, with the lowest agreement in the authors’ re-ratings, 
are also lower than most of the traits on the raters’ ratings against authors’ ratings, 
indicating that these traits are low in objectivity and reliability. 

When all ratings made on each sample of 30 figures by the three different raters 
are compared with ratings by the authors, the percentage of agreement of each rater 
with the authors’ ratings is as follows: 87.37, 84.3°7, and 79.7°7. The last figure 
indicates percentage of agreement of one rater with the authors’ ratings on a sample 
of female figures. The difference between this last figure and the others is statistically 
significant at the 5°% level and may well be a reflection of the variability in the pro- 
ficiency of the raters, since no such difference appeared between ratings made on 
male and female figures by the authors. 

The authors obtained a total of 90.5°% on all their ratings, i.e. on all traits 
combined. Eliminating the Hair and Lines traits, which showed lowest percentage 
agreement, raises this figure to 92.4%. 

Comparing combined ratings (1845 in number) of all three raters with ratings 
of the authors gives a percentage of agreement for the totals of 83.4. 

In the comparison of test re-test ratings, 3 above, the highest agreement was on 
the Body Type trait, 92.557. Some of the consistency on this trait may be attributed 
to the fact that a large percentage of the drawings were rated in only two categories, 
which lowered the degrees of freedom for variability. 

The second highest agreement, 78.5%, is found in the Detail variable, in which 
localization and amount are involved. This is of interest in view of the importance 
of detail in the determination of the I.Q. score in the Goodenough Draw-A-Man 
test, and also because detail is frequently mentioned in clinical interpretation. The 
relatively high consistency of this trait tends to substantiate reliability for this 
factor in both the Goodenough and DAP tests. 

Of interest also is the fact that the most variable traits on test re-test are Breast 
lines, with only 42.2°7 agreement; Position on Page, 45.307; Mouth, 43.697; and 
Position of Hands, 45.9°7. The reliability in terms of per cent of agreement on the 
authors’ re-ratings was relatively high for all of these traits. The traits Breast Lines, 
Mouth, and Position of Hands are often mentioned in clinical interpretations of 
drawings, and much significance is attributed to them by several investigators“, 
especially in relation to known sexual disturbances. It is often assumed that more 
difficulty is encountered in representing these features by the subjects because of 
conflicts associated with these parts. 


The per cent of agreement for all ratings on test and retest was 64.1. 


Discussion 

The foregoing results indicate that, given an objective and explicitly formulated 
rating system, a relatively high per cent of agreement may be achieved in evaluating 
the indices commonly utilized in dealing with the DAP test. In cases where re- 
rating or author-rater rating produced comparatively low percentages of agreement 
the difficulty appeared to be in either the complexity of the trait rated or the var- 
iability among raters in assessing the categories of the trait. This points up the need 
for further refinement of the rating system, greater clarification and simplification 
of the more complex category criteria, and longer periods of training for the raters. 

The test re-test reliability was generally lower than rater reliability ; nevertheless 
it indicated that there is a tendency for many of the traits to remain constant over a 
period of months. The more constant traits include both formal and content aspects. 
This fact seems to contradict somewhat the claim of clinicians “) that content aspects 
(e.g. body detail) are less constant than formal aspects of the drawings (i.e. the 
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mechanics of the graphic expression). However, it is‘possible, as we have pointed 
out above, that certain of these formal aspects may be vulnerable to transient dis- 
turbances in the personality. 

Inspection of the paired drawings often revealed rather striking similarities in 
the gross features of the figures. Oakley, noting this similarity in his work with 
adolescents, reported: 

Perhaps the most remarkable feature of the pairs of drawings obtained in this experi- 
ment is the persisteace of the schemas. Four of the pairs of drawings ... show that, after a 
period of two years, the young adolescent reproduces an almost identical drawing. These 
examples are in no way exceptional. 


On the basis of results obtained in this objective exploratory analysis of the 
constancy of the graphic indices in the DAP, attention should now be turned to a 
study of the validity and reliability of the interpretations which are based on these 
indices. The need for such a study is self-evident. No amount of insistence as to the 
meaning of the indices will suffice as a substitute for a rigorous and systematic attack 
on interpretation. And no amount of rigorous and systematic attack on the nature 
of the graphic indices, in vacuo, will shed light on the reliability and validity of their 
meaning. Studies of interpretation are now in progress. 


SUMMARY 


The major purpose of the present study was to determine the consistency of 
sets of ratings on a number of graphic traits in the Draw-A-Person test. This in- 
cluded three parts. (1) A check of the authors’ consistency was made by rating a 
sample of 90 drawings and re-rating this same sample one week later. (2) The inter- 
rater reliability in the application of the authors’ rating system involved the use of 
three different raters each independently rating a sample of 30 drawings, including 


male and female figures. These ratings were then compared with those made by the 
authors. (3) A study of the consistency with which the figures were drawn by the 
same person on the two different administrations, involving comparisons of the 
ratings made by the authors, was made on the two sets of 181 drawings. One female 
figure was omitted by one subject. This experimental design provided indications of 
the sensitivity of the rating system and the variability of drawings by the same per- 
son. A further aim was analysis of the factors which made for variability in ratings. 
The latter information will be useful in revision of the rating system. 

This exploratory study of reliability in a projective technique indicates that a 
high degree of objectivity may be achieved for certain indices commonly used less 
objectively. Analysis of the results suggests that reformulation and further clarifica- 
tion of criteria for rating certain dimensions and further training of the raters might 
result in substantially higher inter-rater agreement. The results obtained are con- 
sidered sufficiently promising to warrant -validity studies. 


{EFERENCES 
Giepr, F. H., and Lenner, G. F. J. Assignment of ages on the Draw-A-Person Test by male 
neuropsychiatric patients. J. Personal., 1951, 19, 440-448. 


Leuner, G. F. J., and Gunperson, E. K. Height relationships on the Draw-A-Person Test, 
(In press.) 


Lenner, G. F. J., and Siitver, H. Age relationships on the Draw-A-Person Test. J. Personal., 
1948, 17, 199-209. 


Macnover, K. Personality Projection in the Drawing of the Human Figure. Charles C. Thomas, 
Springfield, Il., 1949. 


Oakey, C. A. Drawings of a man by adolescents. Brit. J. Psychol., 1940, 31, 37-60. 





A RORSCHACH STUDY OF SIXTY-SIX ADOLESCENTS* 


FRANCES E, KALLSTEDT 


INTRODUCTION 
The purpose of this study was to discover the characteristics of Rorschach re- 
sults in a group of apparently well adjusted adolescents, and to note if their profiles 
show a statistical trend or pattern comparable with other normal and clinical groups, 
such as those studied by the Buhlers and Lefever. 


METHOD 

During a two year period, forty-one girls and twenty-five boys, ages 13-19 in- 
clusive, were tested. They were enrolled in eight different schools from Junior High 
to Junior College, and selected by a teacher or administrator as being of better than 
average emotional adjustment. While a clinical evaluation was not possible, each 
person recommending the student was asked to/check a list of specific questions de- 
scribing behavior, as well as to give a subjective opinion. The group, therefore, 
represented good emotional adjustment from more than casual observation. 

The chief contribution of the Bubler-Lefever publication shows that the Ror- 
schach technique can statistically indicate the degree of emotional stability and 
maladjustment in a given subject. The authors developed ratings of plus and minus 
weights for a total of 108 significant Rorschach symbols. The algebraic sum of these 
weights, for a single Rorschach test, is called the basic Rorschach Score. This score 
is a measure of personality integration on four levels, as follows: 

I. Adequacy BR Score +15 to +30 
Il. Conflict BR Score Oto +15 
Ill. Defect BR Score -15 to 0 
IV. Reality Loss BR Score -30 to -15 

Each group parallels clinical categories as used in psychiatry: persons having 
the greatest impairment appearing in group IV; while normals, and neurotics such 
as anxiety neurosis and hysteria, fall in groups I and II. 

, RESULTS 

Table 1 shows the mean Basie Rorschach scores of these sixty-six adolescents 

compared with the Buhler normal adult groups and student nurse group. 


TABLE 1. Mean Bastc Rorscuacn Scores OF ADOLESCENT AND ADULT GROUPS 


Groups 


Mean Basic Rorschach 
seores 


Adults: Normals I and IT (highly selected) 21.4 
Normals ITI 14.4 
Nurse students, ages 19-20 11.8 
Group I The total sixty-six adolescents 9.1 
Group II 28 males and females, age 16-19 10.8 
Group III 38 males and females, age 13-15 7.3 
Group IV 25 males 7.9 
Group V 41 females 3 
Group VI 9 males, age 16-19 9.5 
Group VII 16 males, age 13-15 6.3 
Group VIII 19 females, age 16-19 12.2 
Group IX 22 females, age 13-15 8.3 


+e 








*The original study, of which this article is a brief resume, was submitted for research credit 
in the Department of Educational Psychology, The University of Southern California, spring sem- 
ester, 1951. It contains the profiles and charts showing standard deviations. Grateful acknowledge- 
ment is made to Dr. C. Buhler for her interest and encouragement, to Dr. D. W. Lefever for checking 
the statistical data, and to the schools that cooperated to make the testing possible. 
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Table 2 presents the mean values of Rorschach symbols for the various group- 
ings of adolescents. It is apparent that the BR Score decreases with the decrease in 
age; furthermore, that these scores fall in the Level of Conflict. The oldest females 
have the best integrated emotional adjustments, the youngest males the poorest. 


Tapsie 2. Mean VaLues or RorscHACH SYMBOLS FOR NINE Groups OF ADOLESCENTS 


R* | M FM om “«k+K FK F Fe 


Group I 41.6 6.3 10. 3 . 1. 
Group II 42. 6.9 10.- 
Group IIT 41.2; 6.0 10.3 
| 4.4 9.8 

7.6 
Group VI 40.3} 3.7 10-6 
4. 


bo 


w 


Group IV 40 
Group V 42.: 3. 
3.! 


— — 


NonwnNw Ww te 


~~ 


wot 


Group VIII 42.9) 8.4 9.6 


nw 


4. 
Group VII 39.8 9 9.¢ 3.2 2 

3 3.5 
Group IX 42.2} 6.9 10.8 1. 2:4 2:.! 


— 
_ 


3.7 

*Nore: The differences between R and the actual sum of the symbols M to C, inclusive, are due to 
the fact that the means of M to C include the value of the additional responses, while R im- 
dicates main responses. 


P W% D% 


07 oO 
F% A% 
Score 





Group I  -& a 4 4. 5.9 20. S51. 33.4 41. = 9. 
Group II ’ 5. 2: .¢ 4.6 4.4 20. .. mae 9. 4° 10. 
Group II . 3.8 f . 3.§ 5.8 19. Si. 36.0 41. 73 
Group IV ) 4.% 2.4 at 4. 5. 18. 51. 

Group V Dd 8 sq 4.5 d. 20. 28. 

Group VI 2.3 4.: 2.- 4. >. 19. 54. 

Group VII os : 4 4.3 5.5 18. 

Group VIII ¢ ) 8 5.4 2 2. 56. 

Group IX : 3.! 8 a 3.8 . 20. i 37. 


& When profiles are constructed for each group of adolescents they resemble the 
structure of the neurotic adult. In Rorschach terms, both have more FM, more m, 
more c than the well adjusted adult, i.e., greater degrees of unsatisfied instinctual 
need, greater tensions, greater social and sexual insecurity. They parallel the neu- 
rotic, in having a greater amount of introversion (M>Sum C), especially true of the 
younger adolescents who are confused and lack inner organization (low W,). 
Adults have better self-determination (more M) than adolescents. 

Examining the males (Group IV, VI, VII) we see that unsatisfied instinctual 
drive (FM > M) is outstanding in the older group (10.9:3.7), which ratio is con- 
siderably more acute than at the younger level (9.3:4.9); and more pronounced than 
in any other of the profiles. The older group suffers from greater tension (m = 4.4 
vs. 3.2), slightly greater anxiety (k-+ IX) ; but somewhat less self-consciousness (FK). 
The older males have the better contactual ability (Fe), but their social and sexual 
insecurity (c) is more intense. 

The girls’ most difficult developmental problems, as seen by the FM:M ratio, 
occur in the 13-16 year old period. The highest Basic Rorschach score (12.2) belongs 
to the females, age 16-19. Their extraversion-introversion balance is very favorable 
(M:SC = 8.4:4.8); the control over goals also acceptable (M:FM = 8.4:9.6). Their 
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mean scores (Group VIII) show a favorable control of emotionality (FC > CF+C), 
but the amount of CF+C existing at the older level emphasizes the fact that much 
of the uncontrolled emotionality of the young girl is still true at the upper level. 
On the other hand, the older girl is less constricted and rigid in her reality awareness, 
indicated by the lower F°;—i.e., 27) rather than 37%. She has developed greater 
contactual ability, Fe having increased from 2.4 to 3.5. 

Klopfer ©? defines the factor ¢ as the groping for contact and sensuality, as con- 
trasted with the social factor of tact and sensitivity, represented by Fc. This sign 
(c), Buhler calls ‘‘the sign of social and sexual insecurity.”” Both descriptions of ¢ 
are illustrated in all of the adolescent profiles, but are most pronounced for the older 
females. Their mean value of ¢ shows a change from 2.5 to 3.3, describing that in- 
creased concern regarding social and sexual insecurity very characteristic of the girl 
approaching womanhood. Her apprehensiveness and selfconsciousness (FIX), 
‘caution (C’), and concealed inner resentment (S) have also increased with age. 

A third approach in analyzing these groups was made by making a statistical 
comparison of sign agreement and differences between the normal adult group and 
the adolescents, in the same manner as was done in the Buhler-Lefever study 
(1, Chapter 15)" Compared in this way, the most jsignificant differences and similari- 
ties are presented in table 3. 


TABLE 3. SraristTicaL Sign DistriputTion CoMPARING PERCENT OF APPEARANCE 
in ADULTS AND ADOLESCENTS 





Buhler Definition Percent of 
Sign of Appearance Brief 
Number Sign Adults Adolescents Interpretation 





Significantly different results for adolescents and adults: 


10 M>FM 60% 3% Adolescents have more unsatisfied 
instinctual needs. 
8 M=4+ 93 58 Adults have better self-determina- 
tion. 
31 c>Fe 11 48 Adults — better contactual ability. 
32 e=3+ 3 39 Adolescents have greater insecur- 
ity. 
47 =C<M 40 30 Adolescents—more introversive. 
48 W =20-% 30 67 Adolescents—more confused, lack- 
ing inner organization. 
34 C’ present 97 85 Adults—more cautious. 
64 S$=3+ 41 18 Adults—more concealed resent- 
ment. 
66 A%=21-45 90 70 Adults—more resourceful. 
72 P=5 70 90 Adolescents—more aware popular 
point of view. 
Equally represented signs in normals and adolescents: 
98 98 Similar responsiveness. 
80 90 Favorable contactual ability. 
70 70 Reality awareness. 
70 70 - Emotional control. 
63 60 Anxiety tolerance. 
48 Concealed inner resentment. 


Equally rare signs in normals and adolescents: 


1 R = 24- 2 Unresponsiveness. 

5 M=0-1 0 Lacking self-determination. 
15 RFK =3+ 15 I-qually low apprehension. 
25,26 F%=51+ 0 Constricted reality awareness. 
27 Fe absent 0 Poor contactual ability. 

33 C’ absent 3 Lacking caution. 
36 FC absent 0 No emotional control. 
73 F- = 1-2 8 Poor perception. 
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The problem of standards for the evaluation of Rorschachs of children and 
adolescents has been raised and studied repeatedly &!. The opinion prevails that 
adult standards cannot be transferred to children’s Rorschachs, but no definite 
conclusions have as yet been reached as to whether or not children’s and adolescents’ 
Rorschachs form definite patterns and what the pattern characteristics are. 


SUMMARY AND CONCLUSIONS 


In this study, 13-19 year old selected well adjusted adolescents have been found 
to show certain definite trends in their Rorschach patterns. They show significantly 
less M, more FM>M, more m, more ec, e>Fe, and more low W%, than adults. 
That is to say, their Rorschachs give evidence of their lack of direction, their strong 
unsatisfied instinctual needs, their tensions, social and sexual insecurity, and their 
lack of inner organization. These are the most outstanding and psychologically 
well-characterising trends of adolescents’ Rorschachs. 

Although the described pattern appears with statistically significant frequency, 
the individual Rorschachs of this adolescent group show many individual differences. 
The variety appears greater than in normal adult Rorschachs. This impression will 
be investigated further with a larger sample. 


. 


BIBLIOGRAPHY 
Bouncer, C., Bunver, K. and Lerever, D. W. Rorschach Standardization Studies I: Development 
of the Basic Rorschach Score. Mimeographed private printing, Los Angeles, 1948. 
KLoprer, Bruno, and Keiity, DouGias M., The Rorschach Technique. Yonkers: World Book 
Company. 
Davipson, Heten H., and Kioprer, B. Rorschach Statistics. Part IT, Normal Children. 
Rorschach Research Exchange , 1938, 3, 37-43. 
Forp, Mary, The Application of the Rorschach Test to Young Children. Minneapolis: University 
of Minnesota Press, 1946. 
Hertz, Marauerite H. Rorschach Norms for an Adolescent Group. Child Development, 
1935, 6, 69-76 
Kerr, M. The Rorschach Test Applied to Children. Brit. J. Psychol., 1934, 25, 170-185. 
Kioprer, B. and Marcuties, H. Rorschach Reactions in Early Childhood. Rorschach Research 
Exchange, 1941, 5, 1-25. 
ScHACHTEL, ANNA H., The Rorschach Test with Young Children. Amer. J. Orthopsychiat., 1944, 
14, 1-10. 
SuNNE, Daany. Rorsehach Test Norms of Young Children. Child Development, 1936, 7, 304-313. 


Voruaus, Pavtine G. Rorschach Reactions in Early Childhood. Part III, Content and Details 
in Pre-School Records. Rorschach Research Exchange, 1944, 8, 71-91. 





4 


STANDARDIZATION OF A SHORT FORM OF THE WECHSLER 
VOCABULARY SUBTEST* 


MELVILLE FINKELSTEIN, RENATE GERBOTH, AND RUTH WESTERHOLD 
Alton State Hospital 


INTRODUCTION 

One of the major problems confronting the clinician using the Wechsler Scale 
is the length of time involved in its administration. For an experienced clinician to 
administer the entire battery of subtests, it usually takes from 60 to 90 minutes de- 
pending upon variable factors in each individual case. Many studies have been con- 
cerned with reducing the time factor while retaining a sufficiently high reliability 
to warrant acceptance of results. The bulk of these studies, such as McNemar®?, 
Patterson“ ©, Rabin ©, and Hunt et al® have been designed to obtain reliable 
“short forms” of the Wechsler Scales. Though some success has been reported, if 
such short forms are adopted, much valuable and varied material from a qualitative 
viewpoint will have been frequently lost to the clinician working with disturbed 
individuals. The time element is of primary concern to the clinician testing psy- 
chotic patients who are hostile, distractible or lacking in sufficient control, contact 
or psychic energy to maintain their participation for any length of time. The prob- 
lem then becomes one of eliminating or reducing parts of subtests while retaining as 
complete a qualitative and quantitative picture as is possible. 

The most time consuming of all subtests is the Vocabulary Test. Its yield, how- 
ever, is great. It correlates highly with the entire scale and with intelligence as a 
whole“, It is the most stable of the subtests“ and affords a wealth of qualitative 
material useful in determining whether and what kind of changes have taken place 
in the thinking and communicative processes of the patient. Any attempts at short- 
ening would of course lessen the qualitative picture to some extent. Yet we feel that 
this would be far more compensated by avoidance of or reductions in adverse effects 
upon subjects caused by tests both lengthy and demanding concentration. In addi- 
tion time is saved. It (shortening) would aid if proven reliable in the formation of a 
condensed yet highly effective battery. 

Gilhooly “?, using the split half method in four subtests with neurotic subjects 
found the vocabulary test to have a high reliability from sample to sample and a 
split half correlation of .94. 

Webb and De Haan“ also used the split half method in attempting an estim- 
ate of reliability. Their results for normal and paranoid schizophrenic subjects on 
the Vocabulary subtest were correlations of .88 and .83 and reliabilities of .94 and .91 
respectively. The homogeneity of these groups makes the very promising results 
only locally applicable. The purpose of this study shall be, by the use of the split 
half method, to determine whether half of the Vocabulary subtest would be as re- 
liable as the entire list for a general hospital population. 


PROCEDURE 

The data used were the vocabulary test responses from the Wechsler Bellevue 
Scale Form I from 353 hospitalized patients unselected either for age, classification 
or intelligence, the records being supplied by the Jacksonville State Hospital“ and 
Alton State Hospital. Test records were all rescored by two workers whose scoring 
methods were judged to be most nearly alike. The scoring of all questionable re- 
sponses was agreed upon by both workers. The vocabulary list was split into halves, 
the odd words composing one half, the even words the other half. Raw scores on 
the two lists were calculated for each record, and recorded in one of the eight age 


*The authors wish to express their appreciation to Dr. Alvin Winder, Dr. Sheldon Rappaport and 
Mr. Harold Smolinsky for their help and comments. 


“) The authors wish to express their appreciation to the Jacksonville State Hospital, for their help 
in sending us additional data. 
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groups devised and used by Wechsler, each covering five years. The total age range 
was 20 - 60, and the number in each group ranged from 32 - 59. 

The raw scores on the alternate word lists were then correlated, the first time 
for each age group and secondly for the total population using the Pearson product- 
moment correlation method. The Spearman-Brown formula for estimating reliabil- 
ity from two comparable halves of a test was also utilized. Probable errors of the 
correlation were also computed, in addition to means and standard deviations of 
each group. 

RESULTS 

Means of the raw scores of odd and even items (Table 1) range between 9.09 
and 10.72 with means of the entire group being 9.85 and 9.95. This would indicate 
that the estimated verbal 1.Q. of the sample population is 99, somewhat higher than 
might have been expected in a hospitalized group. However since Vocabulary is 
considered by Wechsler and Rapaport to be one of the subtests that remains relative- 

Tas_e 1. MEANS AND STaNpARD DEVIATIONS FoR Eacn HALF 
OF THE VOCABULARY SuBreEst FoR E1cur Ace Groups 
AND THE ToTaL SAMPLE 


Odd Items Even Items 


he Group | Means 8.D. Ricans S.D. 
20 24 7 - 9.09 4.15 9.3- : 06 
25 — 29 3: 9.61 3.38 9.3 

34 56 10. 10. 4- 

= 39 10.36 10.36 

- 44 | § 1 10. 

- 49 46 9 

- 54 B® 8.51 

- 59 36 9.35 


Total 353 9 85 


ly unchanged, we can accept the somewhat higher scores as being representative. 
Similarly, the standard deviations ranging from 3.33 to 4.95 are, in general, some- 
what higher than would be expected in a normal population. The heterogeniety of 
our hospital population and the consequent wide range of vocabulary quotients 
has tended to enlarge deviations from the means. 

Inspection of the data in Table 2 reveals that all correlations fall between .89 
and .96. The highest correlation, .96, is found in the 25 - 29 year age group. This is 
followed by a correlation of .93 in the 50 - 54 age group. A correlation of .91 occurs 
in three age groups, 30 - 34, 40 - 44 and 45 - 49. A correlation of .90 is found in the 
35 - 39 and 55 - 59 year groups. The lowest correlation of any age group, .89, occurs 
in the 20 - 24 year group. The correlation for the entire population is, as expected 
lower than the smaller groups, .89. All correlations are significant at the 1°; level of 
confidence. None of the probable errors of the correlations exceeds .02. 

Use of the Spearman-Brown formula shows that the estimated reliability of the 
split half vocabulary of the age groups range from .93 to .98. Therefore it appears 
that relatively little loss in reliability is involved in the use of either of the alternate 
word lists as compared to the use of the whole list. The results from our larger and 
more heterogeneous population, agree substantially with both studies previously 
cited. 
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TABLE 2. CORRELATIONS OF THE SuBTEST HALVEs, PROBABLE 
ERRORS OF THE CORRELATIONS AND EstiMaTED RELIABILITY 
CoeFFICIENTS. ALL CORRELATIONS SIGNIFICANT AT 
THE 1% LeveL or CONFIDENCE. 


Age Group | Correlation | Prob. Error | Reliabilities 


20 - 2 .89 .02 93 
O01 
Ol 
.O1 


.O1 


Total 


CONCLUSION 

A split-half reliability study was undertaken to determine the feasibility of 
using a short form of the Wechsler-Bellevue vocabulary subtest for a general hos- 
pital population. The vocabulary raw scores from the records of 353 unselected 
hospitalized patients were subjected to statistical measure and evaluation. Data 
were recorded for the 8 age groups devised by Wechsler, each covering five years. 
Correlations ranged from .89 to .96 with an overall correlation of .89 and a maximal 
range of .87 - .97. All correlations were found to be significant at the 1°% level of 
confidence. Correction by the Spearman-Brown formula yields a reliability co- 
efficient of .93 for the total sample and .93 to .98 for the various age groups sig- 
nifying that deviation from the full list is minimal. Therefore when used with hos- 
pitalized patients it appears that relatively little loss in reliability is involved in the 
use of either of the alternate word lists as a substitute for the full vocabulary list. 
However, the complete reliability of this method depends upon further research 
with large homogeneous samples of the various hospital and clinical entities. 
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THE FULL-RANGE PICTURE VOCABULARY TEST: 
Vill. A NORMATIVE STUDY OF NEGRO CHILDREN* 


NEIL W. COPPINGER AND R. B. AMMONS 


Tulane University University of Louisville 


INTRODUCTION 


The Full-Range Picture Vocabulary Test was developed to meet the need for a 
simple test of verbal comprehension ability which could be quickly administered and 
could be used with physically handicapped individuals as well as normals. It was 
standardized and evaluated by Ammons, et al, ©: *: > ® 7 on a sample of 600 whites, 
controlled for age, sex, and socio-economic status. Administration time averaged 10 
minutes. Interform correlation varied with range of talent, but usually was found 
to be in the .90’s. A consistent rise of mean scores with age, and substantial correla- 
tions with 1937 Stanford-Binet and Wechsler vocabulary subtest scores indicated a 
satisfactory validity. 

Intelligence tests have frequently been criticized as being culture—or class— 
bound. The major individual intelligence tests have norms only for a white popula- 
tion. Neither the Wechsler“ nor the 1937 Stanford-Binet “*) standardization 
group contained Negroes. A special study run by Ammons and Agiiero “ on Spanish- 
American children indicated that these children were unfairly penalized when the 
white norms were used to evaluate their performance. Investigations have shown 
Southern Negroes to belong in a special sub-culture and to be penalized by ordinary 
intelligence tests &> 1%, 


PROBLEM 


The purpose of this study was to evaluate the reliability and validity of the 
Full-Range Picture Vocabulary Test for measuring the verbal comprehension ability 
of a representative sample of Southern Negro school children, and to set up norms 
for Negro children based upon this group’s scores. 


OBSERVATIONAL ARRANGEMENTS 


Materials. Full-Range Picture Vocabulary (PV) testing was done with the 16 plates 
developed by Ammons and Huth®. Each plate measured 8% by 11 in. and con- 
tained four separate cartoon-like line drawings of common objects, human activities, 
or familiar scenes. The 85 items in each of the two forms (PV-A and PV-B) were 
mimeographed on an answer sheet, in order of difficulty by plates. The 1937 Stan- 
ford-Binet vocabulary (SB-V) words“ were listed on a special mimeographed form 
allowing enough space for verbatim recording of answers. 


Population. A total of 80 Negro schoo! children from Orleans and Caddo parishes’, 
Louisiana, were tested. There were 40 boys and 40 girls, approximately equal num- 
bers from rural and urban areas. Five girls and five boys were tested at each grade 
level, one through eight. Children were in the “correct” grade for their age according 
to the school system. In a few cases where no such child also meeting the sex and 
parental occupation criteria could be found, subjects were substituted who had never 
failed a grade although they had started late in school and thus were over-age. Ages 
and age variabilities are summarized in Table 1. 


*The test plates and a preliminary manual with final scale norms, sample answer sheets, and 
instructions for administration ean be obtained from R. B. Ammons. 

1The authors wish to express their appreciation to Miss Carmelite Janvier, Director of Special 
Service, Orleans Parish School Board, M. C. Johns, Assistant Superintendent, Caddo Parish School 
Board, and the teachers whose pupils served as subjects. 
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TasLe 1. THe MEANS AND STANDARD Deviations, IN YEARS, OF THE CHRONOLOGICAL AGES OF THB 
Supsects, FoR Each Grape LEVEL 








| 
Number Mean Standard 
Tested Chronological Deviation 
Age of CA’s 
(Years) | (Years) 


6. | 


10 3.6 “32 





Occupations of fathers were obtained from school records, and a sample of 
children chosen so that the occupations of their fathers would be representative of 
those of all Southern Negro males. The procedure foliowed in setting up the occupa- 
tional sampling criteria was as follows: 


The major occupation groups, occupations included under each major group 
and percentage of Southern Negro males employed in the respective groups were 
taken from the sixteenth census of the United States, 1940°. Using these per- 
centages and a total sample N of 80, the number of subjects needed to represent 
each occupation group was computed. It was necessary to combine some of the oc- 
cupation groups because of the low percentage of Southern Negro males employed 
in those groups. The protective service workers (0.3 per cent) were included with 
the domestic service workers. Occupations not reported (0.7 per cent) were included 
with the laborers. Farmers, farm managers, farm foremen, and farm laborers were 
all included in the group of farmers and farm laborers. 

Table 2 summarizes the occupational sampling information. Once quotas were 
set up, each was distributed as evenly as possible among the sex and grade groups, 
within the limits of the numbers in the particular subsample. Subjects were selected 
in rotation from an alphabetized list of the names of all children available in a given 
school and grade, in such a way as to avoid any systematic sampling bias. 


TasLe 2. Troe Occupationat Groups (A) Are Tuose Listep IN THE 1940 Crnsus()*. THe 
CENsus GIVEs THE PERCENT DiIsTRIBUTION BY Mason OccuPATIONAL GROUPS FOR EMPLOYED SouUTH- 
ERN NEGRO MA ss (B). THe PercENT DISTRIBUTION OF THE PRESENT SAMPLE Is GrvEN (C) By 
OccuPATIONAL Groups. AcTUAL NUMBERs TEsTED (D) ArE INDICATED. 











(A) Be! | (B) (C) | (D) 
Occupational Census | Per cent Number 
Group Per Cent | Tested | Tested 





9 


Professional and Semi - Professional 2 
39 
3¢ 


Farmers and Farm Laborers 48.6 
Proprietors, Managers, and Officials ; 
Clerical, Sales, and kindred Workers | 
Craftsmen, Foremen, and kindred Workers 3.6 
Operatives and kindred Workers 10.6 
Domestic Service Workers 3. 
Service Workers, except Domestic | 8. 
Laborers, except farm and mine 21. 


pmwwwu 


~~ 


Smet 


~ 
— a | 


Total 99.2 





*Information was taken from Tables 58 and 64, pp. 75-80, 97. 
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Procedure. All children were given the 1937 Stanford-Binet vocabulary (SB-V) test 
and Forms A and B of the PV test. In administration, the two forms of the PV test 
were separated by the SB-V. The order of the PV forms was reversed with every 
other child. Half of the children were given Form A first, while the others were given 
Form B first. Children were tested individually during school hours in space pro- 
vided by the schools. The tester was a white adult male. Testing lasted from 15 to 
50 minutes, with older children taking longer than younger ones. 

When the child entered the room, he or she was engaged in friendly conversa- 
tion. When rapport seemed satisfactory, testing was begun. Standard procedure “? 
was followed with the PV test. The child was asked to point to the picture which 
meant the same as the word used by the tester, and several sample words were tried 
until the child was judged to understand the procedure. He was then tested with the 
words for each plate until all had been exhausted, or three had been passed or failed 
consecutively. Special attention was paid to the problem of guessing. As a part of 
the instructions, each child was told not to guess but to say “I don’t know.”” During 
testing, when signs of guessing appeared, he was asked to define the word, or asked 
the same item again later in the test. Easy items were slipped in to keep up his 
morale. This procedure seemed to minimize guessing. There were no direct refusals 
to answer. 

The SB-V was given and scored according to the standard instructions in Ter- 
man and Merrill’s book “??. 


RESULTS 

Each subject received three scores, each of which was the sum of the correct 
answers on a test, SB-V, PV-A or PV-B. The means and standard deviations of the 
raw scores, by CA and grade-level, for Forms A and B of the PV, are summarized 
in Table 3 which also includes white population norms®? by age-levels, and pro- 
posed negro norms. An inspection of this table shows no consistent difference be- 
tween the mean scores by age-levels on the two forms of the test. It does show a 
satisfactory progression of mean scores from age-level to age-level. The only inver- 
sion of means occurs at the 11-year level. 


TasB._e 3. Tue Mean Raw Scores anp S. D.’s on Forms A (C) ano B (D) oF THE FuLL-RANGE 
Picture VocaBuLary Test FoR NEGRO Sussects, By AGE (A) anp Grape (B) LEVELS. SMOOTHED 
AVERAGES FOR Boro ForMs COMBINED FOR WHITE SuBJeEcts (F) Are INpicarep, As ARE Proposep 
NEGRO Norms (BE). 





(A) (B) (C) (D) (KE) (F) 
AVERAGE FORM A FORM B Proposed White 
AGE - Negro | Population 
(years) Grade Mean 8.D. Mean 8.D. | Norms Norms* 





5 19 
20 
25. 
29.4 
32. 
30. 
40.2 
43.9 


v.09 18 
21 
25 
29 
33 
36 
40 
44 
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6 
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“Estimated from (5) for these ages. 


The Negro sample failed to obtain scores as large as those of the white popula- 
tion at corresponding age-levels. In most instances the scores for the present sample 
approximate those listed for whites who are two years their junior; e.g., the mean 
score for the Negro sample at the 9-year level is 27.5 while the score for the white 
population at the 7-year level is 27. In contrast these children were from 3 to 5 years 
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retarded on the SB-V and rapport was much harder to maintain. Performance of 
this group on the SB-V is analyzed in detail elsewhere. 


The adequacy of the PV for use with Negro children can be at least partially 
evaluated from the following findings in the study: 


(a) Reliability. For the whole range of talent, grades 1 through 8, raw scores 
on the two forms correlated r = +.96 (N = 80 children, standard error of measure- 
ment of about one word or 3 months mental age). 


(b) Validity. PV-A and PV-B scores correlated r = +.84 andr = +.81 
(N = 80 children) respectively with scores on the SB-V. Item difficulties (estimated 
age at which 50 per cent pass, see™ were calculated, and were correlated? with item 
difficulties similarly caleulated for the white normative population“: ®), PV-A and 
PV-B difficulties for white and Negro children correlated r = +.96 andr = +.90 
respectively, which indicates that the items had about the same order of difficulty 
for the two’ groups. 

The fairly stable age progression in mean scores has already been noted, as has 
the close correspondence between mean scores on the two forms at the various age 
levels. 


(ec) Sex differences. At no age level was there a difference approaching statisti- 
cal significance between the mean score for boys and the mean score for girls, on 
either form of the PV. Item difficulties for the 41 items on the PV-A and the 36 on 
the PV-B, whose 50 per cent passing point for both sexes combined fell within the 
6.0 to 14.0 age range, were calculated separately for males and females. For PV-A 
the correlation between difficulty levels for the two sexes was r = +.97, and for 
PV-B, r = +.94, indicating that there was little difference in item difficulty 
associated with sex. 


(d) Urban-rural differences. Inspection of the data showed that after age nine 
there was an increasing difference between the mean scores of the rural and urban 
groups. This difference increased to about eleven words at age thirteen, with the 
rural children about two and a half years behind the urban in mean scores. It is 
interesting that these urban Negro children’s mean scores were nearly as high as 
those of the white standardization group. 


(e) Practice effect. Each child was tested with both PV forms, half with one 
form first and half with the other. There was no appreciable practice effect detect- 
able by inspection or statistical test. 


DIscUSSION 


Any analysis of the PV test must consider the fact that it is more than a test of 
vocabulary; it tests verbal comprehension and the ability to communicate. Negroes, 
particularly Southern Negroes, live in a different cultural environment from that of 
whites. Perhaps one should use special test norms when making comparisons within 
the Negro group, although use of common norms with whites can be justified where 
Negroes and whites are shown to have the same culture or to have:adopted white 
cultural standards. 

It is possible that the present group of Negro children is more highly selected at 
the upper age levels than the white normative group, due to a larger number of school 
failures and drop-outs. In any case, although the sample is small, the fact that urban 
Negro scores were nearly as high as white normative scores should be thought-pro- 
voking. Perhaps with similar cultural experiences, differences in the intelligence test 
scores of white and Negro children would disappear. 


*Those 41 items on PV-A and 36 on PV-B were used whose 50 per cent passing point fell within 
the age range 6.0 to 14.0 years. 
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SUMMARY AND CONCLUSIONS 


Eighty Negro children in grades one through eight were tested with both forms 
of the Full-Range Picture Vocabulary Test and the vocabulary test from Form L of 
the 1937 Stanford-Binet. The sample was stratified with respect to grade-placement, 
age, sex, and parental occupation. Scores on the two forms correlated + .96 with 
each other and + .81 and + .84 with Binet vocabulary scores. There was no de- 
tectable practice effect. On the basis of these findings, and the results of various an- 
alyses of item difficulty, the authors conclude that the Full-Range Picture Vocabu- 
lary Test is suitable for use with Negro children. Negro norms are given, based on 
the scores of the children in the present sample. 
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INTRODUCTION 


Psychological tests which yield scores on more than one trait present problems 
of interpretation since there is usually no simple formula for combining the series 
of scores to make interpretive judgments. A ‘‘one-trait-at-a-time” technique of 
interpretation is inadequate for this discrete approach fails to take into account 
the fact that the interpretation of one score may be modified by the score on a second 
scale. Thus a clinician should not attempt to evaluate 4 M, an A on aviator or 
a low score on Block Design without taking into account the other scores of the test 
concerned as well as other data that may be available on the client. The interpreta- 
tion of the results of multi-score tests has been facilitated by the use of profiles 
which quickly suggest the scores on which an individual is abnormally high or low. 
Out of experience with profiles the clinician develops the habit of looking for a certain 
pattern to answer specific questions or of making his discriminating or diagnostic 
statements on the basis of the pattern he detects. A particular pattern may be gen- 
erally accepted and validated among many clinicians or it may be a more or less 
unformulated result of the clinician’s own experience with a certain group of clients. 

Profiling and the subsequent interpretation of differences between scores is de- 
ceiving when it fails to emphasize the unreliability of these differences. The reliabil- 
ity of the difference between scores on two scales decreases with an increase of cor- 
relation between the scales and decreases with lower reliability of the scales them- 
selves. This source of error is particularly important with multi-score personality 
tests whose scales are of relatively low reliability and, with exception of scales de- 
rived by factor analytic techniques, of high intercorrelation. These considerations 
of the significance of differences are applicable not only to scores on questionnaires 
but also to the indices derived from such unstructured tests as the Rorschach. The 
low reliability of differences implies that we must seek some method of identifying 
patterns which takes into account the fact that, while the true scores of several in- 
dividuals may follow the same pattern, their obtained scores will vary within a 
certain range of patterns. 

The Minnesota Multiphasic Personality Inventory (MMPI) yields scores on 
nine clinical and four validating scales. With the earliest publications © the authors 
recommended that interpretation should be based on the profile pattern rather than 
individual scores. There are two methods of arriving at statements concerning the 
implication of a certain pattern. Schmidt“, Gough®’, Hovey, Guthrie“, and 
others have found certain profile patterns associated with diagnostic groups. An 
alternative method is grouping patients according to similar profiles and studying 
the groups to find common features of persons having a given profile pattern. Hatha- 
way “) has proposed grouping profiles by the two highest clinical scales. Each scale 
is numbered so that Hs is 1, D is 2 through Ma which is, of course, 9. In this way a 
profile with Hs highest and Ma second highest is said to have a 19 (one, nine) pat- 
tern. The entire profile according to Hathaway’s scheme is described by this series 
of numbers. He has a rather elaborate system of subscripts to indicate those scales 
above a T score of 70, those of equal elevation ete., which is described at length in 
his original publication. Welsh“ has introduced a modification of this classification 
system which is somewhat easier to handle but is not pertinent to this article. 

Most studies concerned with patterns of psychometric results have grouped 
patients according to some non-test criterion and sought common patterns. Few 


*This paper is based on data used in a doctoral dissertation submitted to the Department of 
Psychology of the University of Minnesota. 
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have used the other technique of seeking the characteristics of persons who show a 
certain pattern of test results. Such research would seem desirable inasmuch as it 
constitutes a duplication of the experience of a clinician who, when confronted with 
a test profile, must rely on his experience to develop an assessment of the client based 
upon the pattern of that particular profile. 

Following the latter procedure Hathaway and Meehl have published an atlas ® 
of short case histories and MMPI data on 968 patients. The order of presentation 
is arranged so that patients are presented consecutively according to their profile 
type. They did not offer summaries of the outstanding features of patients with 
similar profiles because they felt that, ‘‘we might have missed many points, empha- 
sized wrong ones, and in general communicated a mixture of truth with our own 
possibly false summaries of a kind of experience that could hardly be adequately 
organized or tabulated, let alone statistically validated”’, (6, p. xvii). While a certain 
validity must be conceded to their argument, they are asserting in effect that all that 
they can communicate to users of the test is the uninterpreted results from past 
patients. If experience with the test gives anything it should give some suggestion 
of the implications of an elevation of scale A in a configuration of elevations and de- 
pressions of scales B, C, etc. To be more specific, if one reads the 12 (Hs D) cases in 
the atlas one gathers the impression of extensive variation within the group. This 
great variation is reduced if one eliminates from consideration those with L scores 
above 60, the point at which a profile is of dubious validity. The remaining cases 
appear sufficiently homogeneous to warrant certain summarizing statements. 


PROCEDURE 
This study is concerned with the characteristics of persons giving essentially 
similar MMPI profile patterns. The data were made available by a specialist in 
internal medicine who used this test with many of his patients. He took a history 


of the patient with a careful list of current complaints and recorded his impressions 
of the patient before administering the test. Subsequent data regarding the patient 
could, of course, be contaminated by the test results. 

The profiles of 1104 patients were coded according to Hathaway’s system and 
grouped according to the profile pattern 12, 13, etc. The files of patients of each pro- 
file group, where numbers permitted, were studied in an attempt to develop a sum- 
mary of the most frequently appearing physical and psychological symptoms and 
the course of the disorders. 

After the descriptions were obtained, the answer sheets of each group were sub- 
jected to an item analysis to discover those items which differed significantly from 
Hathaway’s original standardization group and from a random sample of the total 
population of this study. The purpose of this analysis was to discover those items 
which differentiated the group in question not only from people in general but from 
other patients. It was predicted that many of the items identified would be those 
which appeared on the two identifying scales. Even so, these and others which might 
appear should serve to elucidate the characteristics of the groups concerned. 

Some discussion should be included at this point concerning the nature of this 
sample. These were private patients who chose to come to this particular physician. 
He selected for testing those whom he felt to have complications which were in part 
attributable to emotional factors. The incidence of various code types in this popu- 
lation, particularly those with a high Hy scale, is different from that of other popula- 
tions such as mental hospitals. Indeed, some of the sorts of patients encountered 
in this study are rarely seen in psychological clinics. Test results from diverse sources 
should serve to increase perspective on the various scales and consequently inter- 
pretive skill with the test. 

RESULTS 

Data were available on 365 men and 739 women, a total of 1104 patients. The 
distribution of code types is shown in Table 1. It can be seen that every type ex- 
cept S-, 81 and 83 appeared in this population although many types are so infrequent 
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TaBLe 1. DisrriputTion oF MMPI Prorite Types or 1104 Patients. Tue Hicuest Scaue Is Given 
ON THE ORDINATE, THE SECOND HIGHEST ON THE ABSCISSA. 


9 9g 


1 2 4 6 | 7 ¢ 
(Hs) (D) y (Pd) (Pa) (Pt) (Ma) | Total 


1 (Hs) 4 33 5 | 4 11 ‘ 142 
2 (D) 14 | 64 : 19 55 | 332 
3 (Hy) 40 | 112 so | 5 49 ; 381 
4 (Pd) | 5 87 
6 (Pa) 9 | 6 5 ‘ 55 
7 (Pt) 1 : = | 
8 (Se) a d 2: 
9 (Ma) ‘ 10 j 4 | 56 


*The first column is of those profiles having only one T-score above 54. Thus a profile with only 
Hs above 54 would be coded 1 


as to preclude the possibility of any study. The 1-, 2-, ete. groups are those whose 
profiles had only one scale above a T-score of 54. In the coding of these profiles Mf 
or 5 was not used since this scale has been found to have limited clinical value. There 
follow brief summaries of common features of frequently occurring types. 


12 (Hs D) and 13 (Hs Hy) Profiles. These two groups are described together because 
their differences are slight. These patients presented numerous complaints with a 
strong emphasis on abdominal distress and backaches. There was little demonstra- 
ble physical pathology. Their distress appeared to be only moderate but their his- 
tories showed a high frequency of visits over a protracted period of time with symp- 
tomatic relief usually short lived. Little could be learned concerning the emotional 
factors in the background of these patients for they concentrated upon their aches 
and pains and were relatively insightless concerning their personal adjustment. The 
impression was gained that these patients, more than any other group, had learned 
to live with and to use their complaints to such an extent that they were difficult 
patients to treat. Few showed marked changes for better or worse over periods of 
several years but they did show brief reductions of symptoms over short periods of a 
month or more. 

By way of differences the 12 group reported more feelings of anxiety and tension 
than the 13 group. The differences between the men and women within this group 
are in accord with a trend that appeared in all of the groups. Men with profiles 
similar to those of women placed less emphasis on their emotional difficulties and 
more emphasis on their physical symptoms. This is probably a function of our cul- 
ture in which it is more acceptable for women to admit their worries and fears than 
it is for men. These differences between the men and women of this population be- 
come more apparent in the following 21 and 23 groups. 

The results of the item analyses show that these patients obtain their elevated 
scores primarily from the enumeration of their symptoms. They picture themselves 
in terms of their symptoms and do not admit with significant frequency any items 
which relate to emotional problems. 


21 (D Hs) and 23 (D Hy) Profiles. The male patients of this group showed one or the 
other of two symptom pictures. They complained of marked epigastric distress or 
they complained of tension and depression. Nothing was found in their profiles to 
differentiate these two groups. In contrast to the preceding group there was a more 
marked concentration of symptoms in the upper gastro-intestinal tract with few 
who presented multiple aches and pains. These patients resembled the ulcer syn- 
drome described by Alexander“; they were competitive and industrious but im- 
mature and dependent. Though they dreaded increased responsibilities they sought 
promotions in their jobs. In spite of their difficulties they had maintained their 








144 GEORGE M. GUTHRIE 


normal level of efficiency. Very few of these patients had demonstrable physical 
pathology. When no pathology was demonstrable about one half of them did not 
return for further treatment. There were no data to explain whether they responded 
favorably to reassurance or went to other physicians who would not emphasize the 
emotional etiology of their disorders. 

The females of this group gave a much more varied set of physical complaints than 
the men with a slight emphasis on epigastric distress. Their complaints centered pre- 
dominantly about their feelings of tension and depression. They emphasized a loss 
of initiative, dysphoria and occasional attacks of dizziness and fear. The interview 
materials indicated that these patients accepted a chronic level of maladjustment at 
which others would have been strongly motivated to seek help. Consequently they 
showed a poor response to treatment due to their lack of motivation to improve. 

The item analysis produced items which told of a loss of self-confidence, lack of 
efficiency and brooding. The remainder of the items suggest a subtle rigidity of out- 
look and sensitiveness, an inability to feel comfortable with other people. The 21’s 
showed items which told of poor health and of aches and pains. No items of this sort 
appeared in the 23 results. 


27 (D Pt) Profiles. This was the most homogeneous group found in this study. These 
patients did not stress their physical complaints. One half of these patients were 
seriously depressed. With the rest there were strong symptoms of depression war- 
ranting such diagnostic terms as effort syndrome, fatigue and exhaustion. Psy- 
chasthenic trends of rigidity and excessive worry were frequent among the men. The 
item analysis further clarified the picture of this group, yielding items which des- 
cribed them as unhappy with themselves and uncomfortable with others. They 
complained of a marked loss of efficiency and initiative, brooding and loss of self- 
confidence. Although they are visiting a physician they do not present physical 
complaints. Contrary to expectation, these depressed patients did not respond well 


to treatment. Nor was there evidence of “‘spontaneous recoveries” as is so frequent 
with mild depressive disorders. For the most part their condition remained stable 
over protracted periods of time. 


31 (Hy Hs) Profiles. As might be expected from the discussion of the significance of 
slight differences of highly correlated scales, there are few outstanding differences 
between the patients having 31 patterns and those having 13 patterns. The com- 
plaints of these patients were of the sort that arise secondary to protracted periods 
of mild tension. They include such as headaches, backaches, pains in the chest and 
abdominal distress. Conversion hysteria was exceedingly rare in the entire popula- 
tion on which this study was based so that it is not surprising that there was only 
one instance of it in this group. Few of these patients were incapacitated by their 
symptoms. They appeared on interview to have a lengthy history of insecurity and 
immaturity and a tendency to develop symptoms when stresses increased. In con- 
trast to those whose highest scale was D, these patients placed little stress on the 
discomforts of their current emotional state and in contrast to those whose highest 
scale was Hs, they showed greater specificity of physical complaint and presented 
disorders which were of a somewhat more episodic nature. They differed significantly 
on items which told of aches and pains and on subtle items suggestive of an hysterical 
lack of self-criticism. 
82 (Hy D) Profiles. This group proved difficult to study because fewer than half of 
them returned for further treatment following the administration of the test. There 
Wis no way to discover whether reassurance had alleviated their wide variety of mild 
complaints or whether they changed physicians to avoid discussing their personal 
adjustment. Those who continued contacts showed a history of changing symptoms 
of neither increasing nor decreasing severity. Although the D scale was elevated 
there was little evidence of depressive trends. 

The item analysis produced a smal] number of items telling of concern for their 
health. The remaining items were of the Hy subtle variety which give a picture of 
an individual who verbalizes strict conformity to approved behavior patterns. They 








COMMON CHARACTERISTICS ASSOCIATED WITH MMPI PROFILES 145 


deny unacceptable impulses and ideas and any feeling of social inadequacy. These 
data together with those obtained from interviews suggest an insightless, non-intro- 
spective person who is very resistant to psychotherapy. 


34 (Hy Pd) and 36 (Hy Pa) Patterns. There is a marked predominance of women 
showing these profile patterns, outnumbering the men more than four to one. Their 
presenting physical complaints were numerous. None were of acute onset or in- 
capacitating. On interview they concentrated on their symptoms and minimized 
emotional factors. A check of the items on the test revealed that much of their ele- 
vated profile came from the so-called zero items. These are items which deny short- 
comings present in everyone. They are not so naive as the L scale items but rather, 
on face validity considerations, suggest a superficial outlook on life with an inability 
to recognize the shortcomings of either themselves or their friends. It is the cumula- 
tion of these items that reveals the patient’s overly perfect attitudes toward himself 
and others. With such a personality picture, little was achieved in treatment. 

It should be pointed out that none of these patients showed either frank asocial 
psychopathic or paranoid features. However, the interpersonal relations of this 
group were tenuous and many expressed a well-rationalized hostility toward mem- 
bers of their immediate family. 


VALIDATION OF THESE RESULTS 


It is important in such a study as this to question the validity of the results. 
Another internist made available the profiles and histories of 66 of his patients. These 
histories were studied and four predictions were made of the profile type. An esti- 
mate of the best one could do by chance was made using the four code types with 
the highest frequency in the first group studied. There were 23, 21, 12, 13 for the 
men and 23, 31, 32 and 36 for the women. A profile was considered correctly identi- 
fied if one of the four estimates was correct. The results showed forty estimates cor- 
rect which is an improvement over chance of twenty-seven correct, significant at the 
one percent Jevel. This is an optimal] estimate for it is a test of what the author has 
learned about MMPI profiles rather than what he has succeeded in communicating. 


SUMMARY 


The MMPI profiles of 1104 patients visiting a specialist in internal medicine 
were coded according to Hathaway’s system and grouped according to the two 
highest scales. 

The histories of each group were studied to discover common characteristics. 
Results are presented for those having profile patterns of 12, 13, 21, 23, 27, 31, 
32, 34, and 36. 

A validation study with a different population gave results supporting the valid- 
ity of these descriptions. 


These results were obtained on the population described. They are probably 
not completely applicable to populations which are markedly different. 
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AN INVESTIGATION OF THE RELATIONSHIP BETWEEN 
INTELLIGENCE AND THE STATISTICAL RELIABILITY OF THE 
MINNESOTA MULTIPHASIC PERSONALITY INVENTORY* 


DON L. WINFIELD, PH.D.** 


Kennedy Hospital, Veterans Administration 


INTRODUCTION 

Personality tests are widely used and certain conclusions and assumptions are 
frequently drawn from these tests without any experimental justification for them. 
In spite of their widespread use, there have been relatively few attempts to establish 
experimentally various factors which may have a direct effect upon the reliability 
of these instruments. The general procedure in the construction and development of 
a personality test is to establish norms based upon so-called normal groups. Usually 
by use of split-half techniques, reliability coefficients are computed. Subsequently, 
these tests are administered not only to groups comparable to the norm group but 
also to other groups of subjects such as Negroes (not usually included in the norm 
groups), persons of very superior intelligence, persons of very low intelligence, 
neurotics, psychotics, ete., without any knowledge as to whether the norms are ap- 
plicable or whether the reliability would be different for these groups than for the 
norm groups. The present study was designed to investigate the relationship be- 
tween the intellectual level of functioning as measured by the Wechsler-Bellevue 
Adult Intelligence Scale 1 and the reliability of the Minnesota Multiphasic Per- 
sonality Inventory ®?, The hypothesis under test is that there is a direct relation- 
ship between the intellectual level of functioning and the odd-even reliability of this 
personality test. 

METHOD AND PROCEDURE 

In this investigation the MMPI records of 175 white male veterans with Wechs- 
ler-Bellevue 1Q’s were collected from the files of the Veterans Administration Mental 
Hygiene Clinic in Philadelphia, Pennsylvania. These veterans were all referred to 
the clinie for the purpose of receiving treatment for neuropsychiatric disabilities. 
Of these cases some had been given the booklet form of the test inventory and some 
had been given the box form. Wiener? has previously demonstrated that the ob- 
tained scores are not materially affected by the form used. 

The MMPI records were first divided into four groups according to the full 
scale Wechsler 1Q. These groups were as follows: IQ 91-100; LQ 101-110; IQ 111-119; 
and IQ 120-137. Odd-even reliability coefficients for each group were computed for 
the following scales: F, Hypochondriasis (Hs), Depression (D), Hysteria (Hy), 
Psychopathic Deviate (Pd), Masculinity-Femininity (Mf), Paranoia (Pa), Psy- 
chasthenia (Pt), Schizophrenia (Sc), and Hypomania (Ma). The correlations were 
then corrected by the Spearman-Brown Prophecy Formula“. The correlations for 
the lowest and highest IQ groups were converted into Fisher’s z scores and tested 
for a significant difference between the derived correlations for each scale as against 
the correlation for the same seale in the other group. A composite T score was then 
computed for each intelligence group. The T scores of all the groups were averaged 
for each scale in order to get a composite T score for the 175 veterans. 

The MMPI records were then divided into two groups according to a second IQ 
criterion. One group contained those records of veterans whose Verbal IQ’s were at 
least five points higher than their Performance 1Q’s, and the other group contained 


*Reviewed in the Veterans Administration and published with the approval of the Chief Medical 
Director. The statements and conclusions published by the author are a result of his own study 
and do not necessarily reflect the opinion or policy of the Veterans Administration. 

**Clinical Psychology Section, Neuropsychiatric Service, Veterans Administration Medical Teach- 
ing Group, Kennedy Hospital, Memphis 15, Tennessee. The writer is indebted to Dr. Miles 
Murphy, University of Pennsylvania, for the assistance rendered during the investigation. 
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the records of those veterans whose Performance IQ’s were at least five points higher 
than their Verbal IQ’s. Odd-even reliability coefficients were computed, corrected 
by the Spearman Brown Prophecy Formula, converted into Fisher’s z scores, and 
tested to see if there were significant differences between the two groups on each 
scale. 

The final breakdown of the records was based upon Verbal IQ’s alone. One 
group contained those records of veterans with Verbal 1Q’s of 120 and above. The 
other group contained the records of veterans with Verbal IQ’s of 95 and below. The 
results were treated statistically as with the previous two groups. 


RESULTS 


The corrected reliability coefficients of the MMPI scales based upon total 1Q 
scores of the first four intelligence groups appear in Table 1. The most striking 
finding is the consistently high reliability coefficients on the Pt and Se scales. In 
general the 120-137 1Q group demonstrates a higher degree of reliability than the 
91-100 IQ group although only on the F and the Se scales were these differences 
statistically significant. By contrast the Hy scale of the 120-137 IQ group is sig- 
nificantly lower than the 91-100 IQ group. 


‘TaBLE 1. RELIABILITY COEFFICIENTS OF THE MMPI Scaies accorDING TO THE IQ CLAssIFICATION. 


a F s ¥ Vv See. see oe mee 
120-137 42 .92 0 <a .80 .80 .80 91 .96 ‘ 
111-119 42 .73 aE 75 .84 .50 86 91 .92 61 
101-110 38 .79 84 . 74 46 .79 .89 91 .63 
91-100 53 81 .92 Te -6 .63 .60 66 .89 91 .60 








The composite T scores for each of the four groups and the group as a whole 
appear in Table 2. Since the majority of the subjects used in this investigation were 
veterans with diagnoses of psychoneurosis, it is not surprising that the profile as a 
whole is somewhat elevated and particularly the so-called neurotic triad (Hs, D, Hy). 
It is also interesting to note the elevated F score which apparently is characteristic 
of neuropsychiatric patients. 


TaBLe 2. Composite T Scores oF THE MMPI Scams AccorDING TO THE IQ CLASSIFICATION. 








L F s Hy Pd Mf Pa Pt Se Ma 


120-137 “42 53 j j 69 70 ) 59 67 68 59 
111-119 42. 51 3 80 65 i 62 71 70 61 
101-110 -38 5: 52 j2 69 65 5 62 68 68 59 
91-100 -393 56 57 ) 3 69 ji ‘ 59 70 68 57 





Group Av. 5: 0 ; 72 j é 61 69 69 59 





A comparison of the reliability coefficients of the veterans with Verbal IQ’s 
higher than Performance IQ’s as against veterans with Performance IQ’s higher than 
Verbal IQ’s yields similar results to those reported in Table 1. The predominately 
verbal group has significantly higher reliability coefficients on the Hy and Mf scales. 

The superior intellectual group (Verbal IQ 120-137) demonstrates higher re- 
liability in general than the lowest intellectual group (Verbal IQ 76-95) as indicated 
jn Table 3. Only the Se scale is significantly higher. The failure to get more sig- 

TaBLe 3. RELIABILITY ( 





‘OEFFICIENTS OF THE HIGHEST AND LowEst VERBAL IQ Groups 








coe : me 8 D Hy Pd Mf Pa Pt Se Ma 
120-137 3! A . 86 72 55 81 .79 .73 91 .96 71 





77-95 37 ; 8 .76 .67 .74 64 AT 


87 .89 .67 
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nificant differences may be due in part to the relatively small number of cases. Any 
future research might be directed toward the setting up of cut off points at the lower 
intellectual levels below which the scores are not sufficiently reliable for clinical use. 


SUMMARY 


The purpose of this study was to investigate the effect of intelligence upon the 
reliability of the Minnesota Multiphasic Personality Inventory. Odd-even reliabil- 
ity coefficients were computed, based upon MMPI records of 175 white male veterans 
with neuropsychiatric disabilities. Groupings based upon various combinations of 
Wechsler Scale 1Q’s were treated statistically. These statistical results fail to sub- 
stantiate our hypothesis that there is a direct relationship between the intellectual 
level of functioning and the odd-even reliability of the MMPI although there did 
appear to be a general trend for the group with higher IQ’s to give more reliable 
scores. The Pt and Se scales are the most reliable scales regardless of the intellectual 
level of functioning. Composite T scores for the various intellectual groups are pre- 
sented as well as a composite score for all the subjects used in this investigation. 
The most profitable research in the future might possibly be directed toward finding 
out how low a person can function intellectually and still give sufficiently reliable 
scores that can be used clinically. 
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INTRODUCTION 

A recent survey has indicated that the Thematic Apperception Test has at- 
tained a position in personality testing second only to the Rorschach Test “. Unlike 
the Rorschach, however, the analysis of the TAT by many clinicians and researchers 
has not led to disciplined codification, but rather to greater and greater diversity of 
interpretation. Generally speaking, each new problem upon which the TAT was 
focused stimulated a new analytic approach directly relevant to the particular re- 
search. 

We can roughly categorize the types of analytic approaches used by various 
workers in terms of three levels of abstraction. At the highest level is the approach 
of the clinician in his everyday use of the thematic material. From the story given, 
by the subject the clinician makes direct statements about the individual’s person- 
ality trends. The statements about personality are at a high level of conceptualiza- 
tion and far removed from the concrete material in the subject’s story. 

Recently a number of workers have proposed so-called apriori schemes which 
could be applied to a large variety of research problems and these illustrate two lesser 
levels of abstraction. In what was essentially a normative study Fron“ used a 
thematic type of analysis in which the response categories consisted of descriptive 
statements about the manifest content of the story material. This method illustrates 
the most concrete approach to the analysis of the TAT. 

Somewhere in between these extremes of the levels of abstract scoring lies the 
type of scheme originally proposed by Murray et al“ and more recently systema- 
tized by Aron“), The variables utilized in this third approach are concrete in that 
they categorize the literal thematic material of the story protocol, and also inter- 
pretive in that by definition the various categories make certain assumptions about 
the personality trends underlying the production of the various aspects of the 
protocol. 

All of these methods have both practical and scientific advantages and dis- 
advantages. They vary in the time required for their application, in the difficulties 
in obtaining good reliability of scoring, and possibly most important in the signi- 
ficance of the material they reveal about individual personality functioning. In this 
paper we propose to demonstrate what we feel is a practical and scientific design for 
indirectly validating a scoring scheme similar in structure to that of Aron’s. The 
approach described in this design has been applied to other forms of TAT analysis. 


THE DESIGN 


We propose to select a score on the TAT which differentiates operationally two 
groups from a heterogeneous total population. We will then attempt to establish 
tentatively the psychological significance of this score by hypothesizing for the com- 
parative functioning of our two groups on certain external behavioral criteria. If 
our hypotheses are successful we will then use our independent variable, the criterion 
score, to establish a framework within which to make hypotheses about the function- 
ing of our groups on other, more complex, TAT scores. These hypotheses will then 
be checked against the actual scores of our two groups. 


*Reviewed in the Veterans Administration and published with the approval of the Chief Medical 
Director. The statements and conclusions published by the author are the results of his own study and 
do not necessarily reflect the opinion or policy of the Veterans Administration. B 

This paper is based on a dissertation completed by the author as partial fulfillment of the require- 
ments for the Ph.D. degree at Western Reserve University. The data were collected under the auspices 
of the Cleveland Mental Hygiene Unit of the Veterans Administration. 
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Subjects. Forty white male neurotic and borderline psychotic out-patient vet- 
erans between the ages of twenty and forty, in treatment at the Cleveland VA 
Mental Hygiene Clinic, were selected for testing. The names of these patients were 
suggested by the various therapists, and they were contacted for the expressed pur- 
pose of a routine evaluation. 


Method. A modified form of the TAT! was administered to each patient by the 
author. Two groups consisting of 21 and 19 cases were chosen from the total popula- 
tion in the following manner. 

Each protocol was scored and summarized in terms of two selected sets of 
scores. The first group of scores was composed of the Murray-Sanford ‘need and 
press’ “+” and various other scores which are included in the Manual originated by 
Dr. W. J. Cohen and the present author “>. The sum of the individual need and press 
scores was labeled ‘R’. Instructions to the test require the subject to make up a 
story about the pictures. The need and press scores reflect directly the story told by 
the subject and in a sense serve as a measure of the subject’s adaptation to the test- 
ing situation. We assumed, therefore, that ‘R’, the sum of need and press, represent- 
ed just such a measure of adaptation. . 

The second set of scores was chosen to represent those aspects of the protocol 
which did not contribute to the story quality of the subject’s response.2 The sum 
of these scores was labeled ‘S’. These scores have been taken in the literature and in 
clinical experience as indicators of the presence of anxiety and tension in the function- 
ing of the individual ©. 

It was felt that the relationship between both sets of scores would yield a 
measure of the degree to which anxiety was blocking the functioning of our patients. 
If an individual had.more ‘S’ than ‘R’ it would be an indication that anxiety and 
tension were blocking more effective functioning. If, however, an individual had 
more ‘R’ than ‘8’ it would seem that in spite of the presence of some anxiety he was 
still functioning at a personally adequate level. 

The ratio between ‘S’ and ‘R’ was taken for each individual. A ratio score of 
1.00 and above indicated that the subject had as many or more ‘S’ than ‘R’. A ratio 
of .99 and below indicated that the subject had more ‘R’ than ‘S’. The group of 
forty cases was divided on this basis. A ratio of 1.00 and above placed the subject 
in the high group and a ratio of .99 and below placed the subject in the low group. 
Nineteen of the cases were placed in the high group and 21 in the low group. The 
factors of age, level of education, intelligence rating and psychiatric diagnosis did 
not differentiate significantly between the individuals in each group. We now had 
two groups who differed from each other on scores representing one general aspect 
of their functioning on the TAT. 


TABLE 1. DisTRIBUTION OF THE Ratios or ‘S’ To ‘R’ IN a Grovp or 40 Sussects 








Low Group High Group 
Interval Frequency Interval Frequency 


1 .00-1.19 
. 20-. 36 4 .20-1.39 

1 .40-1.59 
7 .60-1.79 
8 


.O1-.19 





sas ‘Card 1, 2, 3BM, 4, 6BM, 7BM, 12M, 13MF, 14, 15, 18BM, and 20 of the 1943 edition were 
mciuded, 


*These factors included ‘pauses’, ‘self-reference’, ‘stimulus reference’, ‘object description’, ‘state- 
ments of probability and possibility’, and ‘alternative plots’. 
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In order to predict comparative scores for the two groups on the more discrete 
scores we had to develop a framework which would guide the establishment of hy- 
potheses. As mentioned above, the author felt that the index obtained in the ratio 
score if representative of a significant personality variable, would be most closely 
related to a measure of anxiety or tension. It was assumed that the group which 
had more ‘S’ than ‘ R’ was being blocked by anxiety or tension to the point at which 
their productivity was being inhibited, whereas the group who maintained a pre- 
dominance of ‘R’ scores was handling its anxiety in such a manner as to be, in terms 
of their personal level of functioning, relatively more productive. 


Validating Criteria. Two external (to the test itself) criteria were chosen which, 
if successful in differentiating between our groups, would make us more confident 
in establishing this framework for predicting more particularized differences in the 
test scores. The first criterion chosen was a rating of overt anxiety expressed by the 
subject in interviews with both social worker and psychiatrist. These workers make 
routine statements about the degree and type of anxiety that the subject expressed 
in their interview with him and it was felt that these statements could be rated re- 
liably and the groups compared on the basis of the ratings. If the L group (the group 
with a ratio of .99 or less) would have a significantly lower ‘anxiety rating’ than the 
H group (the group with a ratio of 1.00 and above) then we would have some basis 
for assuming that the groups actually differed, at least at one level of expression, in 
the amount of anxiety that they manifested. 

The statements made by the social worker and psychiatrist were rated on a 
three point scale. A statement such as “‘The patient appeared quite calm and re- 
laxed”’ was rated ‘0’. A statement such as “The patient was somewhat tense but ap- 
peared fairly well controlled” was given a rating of ‘1’. A statement such as “The 
patient was quite tense and anxious” was rated ‘2’. Both reports were read for each 
subject and it was found that statements of this type were clear and concise and that 
in no case did the rating based on the social worker’s report differ in any respect from 
the rating based on the psychiatrists’s report. The results of the group comparisons 
on the ratings are presented in Table 2. 


TABLE 2. RatTInNGs or ANXIETY AND CHANGEs IN SYMPTOMsS* 








| 
Cases at 
and above 








Variable 


Anxiety Rating 


Symptom Change 


Cutting Point 





Chi- 
Square 








6.42 
12.13 





Criterion 











*Ratings 
0 


No anxiety 
1 Controlled anxiety 
2 Uncontrolled anxiety 


**P/2 Since we are estimating the probability that group H will exceed group L 


The second criterion used was a count of the absolute number and different 
types of symptoms reported by the subjects in the first intake interview with the 
social worker compared to the number and type reported to the psychiatrist in the 
second intake interview. The hypothesis stated that the H group should have a 
greater number of changes than the L group. The rationale for the hypothesis was 
based on the assumption that a well established set of symptoms binds anxiety and 
was related to Freud’s suggestion of the inverse relationship between symptoms and 


3It would appear that observations as to the person’s overt expressions of anxiety have a high 
degree of reliability. 
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anxiety“. It was felt that if the H group were actually more anxious they should 
manifest their difficulties in a lack of adjustment to a specific pattern of symptom- 
atology. 

The number and type of symptoms reported to the social worker were listed 
and compared to the number and type reported to the psychiatrist in the subsequent 
interview. The results of the comparison are also presented in Table 2. 








TABLE 3. ComPaARIsON OF RaTINGs BY DIFFERENT INTERVIEWERS 
Variable Frequency Chi-Square 


| 
ae ie Se 


Social Worker I 9 
Social Worker II 

Psychiatrist I 

Psychiatrist II 


Psychiatrist III ¢ ) .66 5 





In order to be relatively more certain that these differences were not due to 
other measureable factors which we could reasonably expect to have some direct 
influence on these results, two items had to be controlled. It was felt that the specific 
interviewer might in some way elicit systematically a greater number of symptoms 
from the subject. -We had, therefore, to make certain that neither group was seen 
by any specific interviewer more frequently than by any other. The time interval 
between interviews also had to be controlled since we could normally expect a greater 
number of changes with an increase in the time interval. Tables 3 and 4 present the 
results of this check. It was clear that the hypotheses were verified at a good level 
of probability and that the factors which we checked did not contribute towards 
these differences. It was, therefore, safe to conclude that a measure based on a sub- 
ject’s response to the TAT permitted us to predict successfully for comparative 
measures on our cases On criteria external to the test itself. 


Tasie 4. Time Lapse BETWEEN REPORTING OF SYMPTOMS 








Cases at 

and above’ | Chi- 
Variable | Cutting Point ————_| Square 
| & H 


Time Lapse (days) | | 10 





RESULTS 

Prediction of Manual Scores. We found that we could successfully predict the 
behavior of our two groups on external criteria when we knew the differences be- 
tween them on certain TAT scores. In spite of our successful prediction of the two 
behavioral criteria we could not maintain that our subjects did differ in degree of 
anxiety since we felt that the measures used were too narrow in scope for valid 
selection. The variable was chosen simply as a matter of convenience in that we 
needed some conceptual framework within which to define more discrete hypotheses.* 


‘In terms of the general point made in this paper it would not even have been necessary to check 
some external criteria of anxiety for we could just assume that differences in the operationally defined 
measure of anxiety on the TAT may actually represent some real difference in the personality structure 
of the two groups. 





AN INDIRECT VALIDATION OF A THEMATIC APPERCEPTION TEST 153 


With some evidence that our groups differed in degree of anxiety, we then pro- 
ceeded to formulate predictions as to how our two groups would perform on other 
aspects of the thematic productions. Based on psychoanalytic theory we estab- 
lished for ourselves a picture of the personality of the anxious character as compared 
to one relatively less anxious. Keeping in mind some preconceived notions as to the 
psychological significance of our scores of ‘character’, ‘need and press’, ‘level of 
action’, etc., we then formulated 57 hypotheses against which to check the relative 
performance of the two groups. These hypotheses were classified under such formal 
and content categories as “ Differentiation of the Self and of the Environment”’, 
“Empathic Capacity”, “Integration and Articulation’’, “Depressive Tones”, “Guilt 
_and Moral Values’’, ‘ Aggression”’, ‘‘ Punishment and Lack of Love’’, etc. The Chi- 
square test of significance of difference was used and the .05 level of probability was 
established as the critical point of success or failure of prediction. 


Tas_e 5. RELIABILITY OF REsULTs 





No. of Chi- Significant at 
Hypotheses Squares .05 pr better Percentage 


57 175 73 41% 





Checking the 57 hypotheses involved the use of 175 Chi-square tests of sig- 
nificance. We found that 41°; of the statistical tests were significant at the .05 level 
of probability or better. Utilizing the five percent level as the point at which we 
accept or reject our hypotheses implied that by chance alone we could expect that 
nine (5%) of our Chi-square tests would be significant. Our result was markedly 
above chance and we could state with some confidence that our scores did differ- 
entiate adequately between our operationally selected groups. 


CONCLUSIONS 


In promoting a new and untried scoring system the most frequent approach 
is to plunge into direct validation attempts by comparing the scores in various 
diagnostic groupings. This procedure is expensive and time consuming and cannot 
yield a satisfactory overall evaluation of the scheme. In the event of successful 
differentiation of criterion groups we can only state that the scores can discriminate 
between groups varying along one independent variable. We do not know whether 
the individual scores have any meaning in relation to each other nor whether they 
complement each other adequately in a description of personality. 

This design seems especially applicable to the empirical evaluation of new and 
unproven scoring variables. It does not necessitate high-powered selection methods 
against which to check an unvalidated apriori method. It does not demand a pre- 
mature validation research with a technique whose usefulness for a particular var- 
iable is still a matter of conjecture. Although the design does necessitate the use of 
a specific personality variable for the sake of establishing a theoretical framework, 
the choice of anxiety, which is considered quite a generalized and more or less uni- 
versal trait, was geared to this overall approach. 

We have assumed implicitly that a personality picture derived from the psycho- 
analytic theory of anxiety is essentially a consistent one. We have formulated the 
various concepts in such a manner as to be able to predict successfully the compara- 
tive behaviour of our two groups on our scoring manual. We can conclude, therefore, 
that to the degree to which the psychoanalytic theory of anxiety, as well as our con- 
ceptions of it, depicts a consistent and complementary personality picture, so does 
our scoring system to an approximation of that degree, depict a consistent and com- 
plementary personality picture. 
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We now feel justified in expending greater time and effort on more stringent 
selection procedures in establishing criterion groups against which to validate our 
scoring scheme. 

SUMMARY 
1. Various types of analytic approaches to the TAT were discussed. 
2. A method for evaluating a new scoring system for the TAT was described. 


3. A measure of internal consistency was obtained by using the psychoanalytic 
theory of anxiety as an instrument for predicting the functioning of operationally 
selected groups on our scoring scheme. 


4. Our successful prediction led us to the conclusion that our system of scoring var- 
iables did accomplish a consistent description of personality. 


5. We further concluded that the results of this study justified approaching more 
direct validational procedures for our manual. 
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STIMULUS FUNCTIONS OF THE SZONDI CARDS 
N. ELAINE DAVIS AND VICTOR C, RAIMY* 
The University of Colorado 


INTRODUCTION 


As a relative newcomer to the field of diagnostic psychological testing, the 
Szondi Test has recently received a large amount of attention. Unfortunately, the 
literature contains very few studies relating to its validity. Although the only final 
method of validating a personality measure must depend upon empirical findings, 
study of the underlying theory can serve as a means of studying the test itself and, 
indirectly, its validity. The investigation reported below failed to find support for 
what appear to be three of the basic assumptions underlying the test. Since, how- 
ever, the investigation was limited to a study of what appear to be some of the logical 
assumptions of Szondi rationale rather than variables definitely known to be oper- 
ative in and central to the Szondi Test, our results must be viewed only as raising 
serious questions about the usefulness of the test, Nonetheless, in the light of this 
and two other studies which also have given negative results“: 4), the burden of 
proof seems now to rest with the proponents of the Szondi. 

Unlike other projective techniques which permit a very wide freedom of re- 
sponse to stimulus situations, the Szondi is based upon the detection of standardized 
uniformities in response rather than upon the relative uniqueness of individual per- 
ception. (Whether the Szondi is a ‘“Projective’”’ technique is of no moment here.) 
The scoring system of the test as well as the assignment of test pictures to prear- 
ranged categories assumes that each picture has a certain stimulus value somehow 
resident in that picture. Logically, then, the following three assumptions investi- 
gated in the present study seem to be demanded by Szondi scoring and interpre- 
tation." 

1. Each individual card should have a standard stimulus value. 2. All cards in the 
same clinical category (i.e. the 8 categories such as Sadist, Homosexual, etc.) must 
have similar or identical stimulus values. This second assumption is required by the 
fact that in Szondi scoring, cards within a given category are treated interchange- 
ably, as the same score is given for any combination of an equal number of cards 
selected from that category. 3. There should be no overlap between categories in regard 
to the standard stimulus value of the category. Cards selected from a particular cate- 
gory purportedly represent specific personality characteristics, and each category is 
definitely differentiated from each of the other 7 categories. The fact that Deri ® 
and others insist that the test be interpreted on the basis of the total profile inval- 
idates none of the assumptions above, since the profile is obtained by a mechanical 
scoring which makes use of these three assumptions. 


METHOD 


If it can be shown that each card does not have a standard stimulus value, and 
that there are differing stimulus values for cards within a given category, and that 
there is overlapping between categories, the assumptions listed above cannot be sup- 
ported. Our methodology consisted in attempting to determine whether each of the 
Szondi cards has a standard stimulus value by asking groups of subjects to make 
multiple choice selections from among a series of free descriptions of the “‘expression”’ 
on each card obtained from a first group of subjects. The study was conducted in 
three stages and the data from each stage were utilized in each of the succeeding 
stages. 


*Appreciation is expressed to K. F. Muenzinger and Maurice Smith for their help and criticism 
in this research. 

‘Schafer (5) has recently and independently made a similar analysis and also lists additional 
assumptions of the test rationale. ; 
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Only 24 of the 48 pictures in the test were studied but those selected consisted 
of half of the cards from each of the eight categories. In each stage, the cards were 
projected individually onto a screen to groups of ten to fifteen subjects. Care was 
taken that the seating arrangement did not produce visual distortion. All subjects 
were volunteers from beginning classes in psychology and none was known to be 
acquainted with the Szondi Test. The steps and results of each of the three stages 
of the study are presented separately below in order to show what was done to avoid 
forcing subjects to narrow down their selection of descriptions by methodological 
bias. 


Stage 1. At least eight frequently employed descriptions were desired for each 
of the 24 cards in order to provide a wide range of choice for a second group of sub- 
jects who in (Stage II) were asked to make multiple choice selections for each card. 
Therefore, in Stage I, 39 subjects were asked to write a brief, free description of the 
“expression’’ on each card after a two minute projection period. Three judges, work- 
ing independently, collated the obtained descriptions for each card in terms of 
identity and similarity. Each judge compiled a list of the eight most frequently 
employed descriptions for each card. and, finally, in a committee session, the judges 
reached agreement upon 24 lists of eight descriptions each. As this procedure yielded 
192 descriptions, only one example can be reported here. Hysteric card II-I had the 
following list: 1. Pained, tormented, grief-stricken; 2. Sad, sorrowing; 3. Haughty, 
arrogant, disdainful; 4. Scornful; 5. Disgusted; 6. Cruel, vindictive; 7. Impassive; 
8. Sickening hatred. 

The three judges felt their task had been relatively easy and that there was very 
considerable agreement among their lists prior to final selection during the com- 
mittee session. Inspection of the lists indicated that for some cards there appeared 
to be a very narrow range of possible expressions, while other cards appeared to have 
a great range. --~ 


Stage II. It was now desired to determine whether any of the free descriptions 
for a given card would be chosen better than chance by a second group of subjects. 
Following the procedure described above under Method, 64 new subjects, before 
seeing the lists prepared by the judges, wrote preliminary free descriptions for each 
card. This was done in order to avoid their being too greatly influenced by the pre- 
pared lists. Immediately after writing their free description, the subjects then 
selected their first three choices from the prepared lists. The free descriptions from 
Stage Il were quickly discarded, since there was very high agreement between them 
and those free descriptions which were obtained during Stage I. The few “original’’ 
and ‘free’? descriptions obtained in Stage II appeared to be idiosyncratic responses 
since none was given a significant number of times in the ‘‘free’’ situation. 

The data from the first group of 16 subjects in Stage II were tabulated separ- 
ately in order to determine the consistency of results. A simple comparison based 
upon the ranks assigned to each of the 8 descriptions for each card, between the first 
group of 16 subjects and the remaining 48 subjects, revealed very high consistency 
between the two groups, since only 30 out of 192 descriptions changed rank order 
more than two ranks. The data from Stage II agreed quite closely with the data 
obtained from Stage I, since for most cards those descriptions which were most fre- 
quently given by the subjects in Stage I (‘‘free’’ descriptions) were also selected with 
very high frequencies by the subjects in Stage I]. Also, cards for which there was a 
narrow range of descriptions for Stage I also resulted in a narrow range for Stage IT. 
Conversely, other cards produced a wide scatter in both stages. 

Thus, Stage II confirmed to a large extent the results obtained for Stage I and, 
more concretely, produced three descriptions for each card out of the eight resulting 
from Stage I. Stage II was essentially a process of narrowing down the possible 
descriptions obtained from the “‘free’”’ description situation by employing an entirely 
new group of subjects. 
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Stage III-A. Inspection of the three top choices for each card resulting from Stage 
II revealed that many of the 72 descriptions (3 for each of the 24 cards) were similar 
or identical and could logically be grouped together. Again judges were resorted to 
and two staff members plus one graduate student in psychology were asked to group, 
according to identity or similarity, the 72 descriptions into the smallest number of 
different clusters which would adequately include all of the 72. A final committee 
session was used to produce a list (Table 1) which included sixteen clusters embody- 


TABLE 1. Tue SrxTEEN CLuUsTERS OF DESCRIPTIONS OBTAINED IN SraGe III-A From Turee JupGEs 








Sad, hopeless, grieved, pained 9. Amiable, gentle, kind, loving 

Self-satisfied, smug, arrogant, boastful 10. Trusting, admiring, worshipping, adoring 
Pleased, amused, contented, happy, joyous 11. Wise, intelligent, distinguished 

Stern, determined, commanding, domine ering 12. Prissy, effeminate, affected 

Mean, crafty, suspicious, sullen, revengeful 13. Angry. , accusing, disliking, scorning, hating 
Doubtful, puzzle d, confused, worried. 14. Humble, submissive, shy, timid 

Impassive, indifferent, bored, weary, apathetic 15. Emb: urrassed, shame ‘d, humiliated 
Thoughtful, attentive, interested, pondermg 16. Fearful, threatened 





ing the original 72 descriptions. The condensation of the descriptions was done 
without reference to any cards. Since there is no known way of objectively sorting 
adjectives into homogeneous groups, criticism of the sixteen clusters can certainly 
be voiced. For example, the range of “expression” within a cluster provides an 
unknown degree of looseness for any analysis based upon these clusters. None- 
theless, inspection does seem to reveal considerable uniformity within each cluster 
and a minimum of overlap between clusters. 

With the sixteen clusters, it became possible to re-analyze the data obtained in 
Stage Il by grouping each of the 72 descriptions (obtained in Stage II) under one 
of the 16 clusters. 


TaBLE 2. REGROUPED Data FROM Stace II SHowrna Ciusters SELECTED BETTER THAN CHANCE 
(.05) ror Eacn Carp 








Szondi Category Card No. Cluster No.* 


II -1 
Hysteric IV -2 
VI-6 
II -5 
Depressive IV -3 
VI-7 
II -3 
Epileptic IV -7 
VI-8 
II -2 
Manic IV -5 
VI-1 
II -4 
Homosexual IV-8 
VI-2 
II -8 
Catatonic IV -4 
VI-3 
; II -7 
Sadist IV -6 
VI-4 
Il -6 

Paranoid IV -1 11 
VI-5 6 13 


*The numbers in this column correspond to the numbered clusters in Table 1. Thus the names 
of the clusters can be found in Table 1. 
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Table 2 shows the results from Stage Il when the data were regrouped by the above 
method. Only those descriptions were regrouped which were selected significantly 
better than chance in Stage II at the 5°; level of confidence or better. The number 
of the significant clusters for each card is given in Table 2. The names of the clust- 
ers, corresponding to the numbers in Table 2, will be found in Table 1. 

In drawing up Table 2 the statistical procedure employed was as follows. The 
problem was to determine those descriptions for each card which were selected more 
frequently than would be expected by chance alone. In doing this, the expected 
frequencies and the expected standard deviations were calculated. Then the limits 
in terms of 1°,, 2;, and 5°; levels of confidence were determined. 


Stage III-B. Although the results of Stage III-A indicated that pictures in the 
same category were not likely to be described in the same fashion when subjects 
selected descriptions in a multiple choice situation, it was felt that further investi- 
gation of this point was called for. This resulted in Stage III-B which was also used 
to ascertain further whether there is a standard stimulus value for each card. 

The procedure consisted in having the writers prepare a list of eight clusters of 
descriptions (taken from Table 1) for each of the eight diagnostic categories. That is, 
for each of the three cards in the same diagnostic category, the same list of eight 
clusters was employed. Each of the lists contained the three most frequently selected 
descriptions obtained for that card during Stage II. Although the same list of eight 
clusters was employed for all pictures within a given category during Stage III-B, 
the order of the clusters within the list was markedly altered for each card within 
the category in order to avoid the slight possibility that similarities between lists 
would be detected by the subjects. These lists were then presented in our standard 
fashion to 52 new subjects who were instructed to rank their first and second choices 
for each picture. Again the obtained data were analyzed to determine whether one or 
more clusters would be selected for each card in excess of chance by using the statis- 


tical method described above under Stage III-A. Table 3 presents the results by 


TasB.e 3. CLusTERS SELECTED Berrer THAN CHANCE (.05) From Srace III-B 





Szondi Category Card No. Cluster No. 





II - 
Hysteric IV - 
VI- 
II 
Depressive IV 


one 


Epileptic 


maIw Iwo 


Manic 


7 


Homosexual 


I 
Noe, Re 


- om 


Catatonic 


w 


Sadist 


eon 


o~ 
re 


Paranoid 


oo 





STIMULUS FUNCTIONS OF THE SZONDI CARDS 159 


showing the clusters (numbered in accord with Table 1) which were selected better 
than chance. 

Comparison of Tables 2 and 3 shows there is a considerable amount of consist- 
ency in the selection of clusters for the same card, even when different groups of sub- 
jects are used. However, despite the consistency among subjects, the cards fail, for 
the most part, to yield single clusters and there is considerable overlapping between 
cards from different categories. 

Table 3 provides us with information on the likelihood that cards within a given 
category all have the same stimulus value. Simple tabulation of overlap within the 
three cards of a given category reveals that there is very little likelihood that intra- 
category similarity of stimulus value is high. Such a tabulation shows that for the 
three cards used from the Hysteria category, two descriptions are common for two 
of the cards but not for the third card. For the Depressives, only one description 1s 
given better than chance for two of the three cards. For the Epileptic cards, no 
identical description is given for even two cards. For the Manic cards, one descrip- 
tion is given for two cards, and the same description @? is also given for two of the 
Homosexual cards and one Depressive card. 

Considering the stimulus value of a particular card without regard to category, 
it appears that for three cards there is a standard stimulus value, since in Stage IU, 
only one description occurred with greater than chance probability. These three 
were Depressive card 1V-3, which was described as ‘Mean, crafty, suspicious, sullen, 
revengeful”; Manic ecard LV-5, which was described as ‘‘Self-satisfied, smug, boast- 
ful, arrogant’; and Catatonic card VI-3, described as “Pleased, amused, contented, 
happy, joyous’’. All other cards seemed to lack a standard stimulus value. 

One possible major criticism of the present study could not be investigated 
objectively. This problem is concerned with the difficulty of deciding whether the 
16 clusters of descriptions presented in Table 1 are actually independent and non- 
overlapping descriptions. If they are non-overlapping, the analysis of Stage II-B 
indicates that for 21 of the 24 cards studied, there is no single standard stimulus 
value. If it can be shown that overlap occurs among some of the 16 descriptions, 
then the conclusion is not clear-cut. For purposes of analysis, several judges were 
asked to indicate whether the two or three clusters selected better than chance for 
each of the 21 cards would be “compatible” in the same picture. The results in- 
dicated that for nine cards, at least two of the clusters selected could be considered 
“compatible”. For example, the judges agreed that the two clusters selected for 
Catatonic card II-8 “humble, submissive, shy, timid” and “embarrassed, shamed, 
humiliated”? might well be compatible in the same expression. There were, however, 
three cards for which compatibility was very uncertain and 9 cards for which the 
judges decided there was distinct incompatability in the clusters selected by the sub- 
jects. The question of compatibility bears only upon the problem of the standard 
stimulus value of particular cards and appears to have little relation to the assump- 
tion that cards within a given category should have the same stimulus value. 


DIscUSSION 


Many objections can be raised to the methods employed in reaching our present 
conclusion that most of the cards in the Szondi test do not have a standard stimulus 
value and that cards within a given category do not have the same stimulus value, as 
required by the scoring of the Szondi test. Only the most perplexing objection will 
be mentioned here since most of the other problems seem to stem from it. This 
major difficulty lies in the fact that the method of impression, as used in the present 
study, fails to provide clear-cut variables which would make it possible to determine 
the “impression” of a given card and at the same time exclude the possibility that 
other variables may be present and influential, but not detected. Although semantic 
difficulties enter the problem, they are not as troublesome to deal with as the possibil- 
ity that subliminal cues may be more influential in the subjects’ selections durin 
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the testing procedure than they are in the verbalized descriptions obtained by us 
from our laboratory subjects. If the subliminal cue hypothesis is correct, it would 
seem that evidence is needed to substantiate it. 


SUMMARY 


College student subjects were asked to describe half of the Szondi cards in order 
to determine if there are standard stimulus values for each of the cards. Later stages 
of the study attempted to ascertain which descriptions were agreed upon for each of 
the cards at better than chance frequency. The results indicated that there are 
standard stimulus values for only three of the 24 cards employed. Two or three 
descriptions seem appropriate for each of the remaining 21 cards. There is no diag- 
nostic category in the Szondi scheme for which a description, common to all cards 
within that category, was obtained. The stimulus value for cards in a given category 
is neither common to nor unique to that category, but apparently belongs to cards 
in other categories as well. 
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INTRODUCTION 


Many psychologists have made the observation that the Szondi test administra- 
tion procedure forces a subject to select likes and dislikes regardless of how he feels 
in general toward a particular set. He is forced sometimes to select a picture as a 
like because it is the “lesser of two evils.” In the hypothetical case we are talking 
about, he would by no means select any of this “disagreeable” set as likes if he were 
to pick out the twelve pictures that he liked best from among all 48 spread out on 
the table. Still in the conventional procedure he is forced to designate two of the 
pictures from this set as liked. 

This hypothetical condition implies two things: 

1. The word “liked” should be used only in context. This holds true 
for such words as “preference” and ‘‘valence”’ as well. Since Szondi 
picture selections are made with respect to the others in the set only, 
and not with regard to the other 47, we must be careful of shifting 
context and talking about ‘“‘the most prefefred”’ pictures (the context- 
ual implication being among all 48 pictures). 

2. If some sets are more disagreeable, i.e., offer fewer opportunities 
for the subjects to find a picture they really like, then the sets must be 
differently constituted with respect to preference values of the individual 
pictures. ; 


The idea of preference value can be obtained best by a consideration of a pro- 
cedure analogous to the Szondi test. This procedure known as the forced-choice 
rating technique requires the rater to consider a small group of items describing a 
worker’s characteristics.!. He is forced to select the ones which best describe the 
individual being rated and those which are least applicable. The number of these 
items so marked is predetermined by the number of the items in the group and the 
rater is forced to make the prescribed number of selections. He then proceeds to 
mark the other groups of items on the rating form in a similar fashion. The grouping 
of these descriptive items is not arbitrary but is designed to make all within a group 
appear about equal in desirability. To determine the preference values, which in- 
dicate the supposed desirabilities, the ungrouped items are presented to many judges 
who rank or otherwise rate them for desirability. The results of this procedure are 
the preference values for the individual items. It is then a simple matter to group 
the items in terms of comparable preference values. In spite of this equating for pre- 
ference values within the set, these items will have demonstrated different diagnostic 
values. That is, some will be associated with satisfactory work while others will be 
associated with poor vocational adjustment. The total work adjustment is repre- 
sented by a composite of the negative and positive diagnostic values for the items 
marked by the rater. For further description the reader is referred to Richardson’s 
paper @), 

We see that in essence the Szondi procedure may be regarded as a forced-choice 
technique. This forced-choice application to Szondi has two advantages: 1. It causes 
the subject to make a constant number of choices which will be great enough to be 
evaluated reliably. 2. An equation of general preference values within each set 
makes the test more sensitive to the individual differences to be evaluated. 


*Now at Beatty Memorial Hospital, Westville, Indiana. 

1The authors wish to acknowledge James 8. Karslake’s contribution to their understanding of the 
methodology of the Szondi test by suggesting the parallel between the forced-choice rating technique 
and the Szondi administration procedure. 
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STATEMENT OF THE PROBLEM 


No information could be obtained from the literature pertaining to the prefer- 
ence values of the Szondi pictures when the 48 pictures were presented simultaneous- 
ly. It was the aim of this study to determine such values for a restricted sample and 
apply the results to an analysis of the Szondi procedure as a forced-choice technique. 


PROCEDURE 

Kighty naive undergraduate students in an introductory course in general 
psychology volunteered as subjects for this study. Forty males and forty females 
were equally distributed between the experimental and control groups. The control 
subjects took the Szondi test in the conventional fashion, while the experimental 
subjects made their selections from all 48 pictures presented simultaneously. The 
simple subtraction of the number of times a picture was disliked from the number of 
times it was liked gives an approximate location of the central tendency from the 
preferred end of the scale. This measure will hereafter be referred to as the prefer- 
ence value of a given picture. Such a procedure assumes equal steps along the rank- 
order scale. A study by one of the authors“? confirms this to a large extent. 


RESULTS 

It was possible to group the derived preference values by Szondi sets. This 
information is given in Table 1. The ranges and medians of the preference scores in 
each set is given so that a comparison with Table 2 is facilitated. Thus Table 2 
presents an arrangement of Szondi pictures in sets which have the smallest possible 
differences between the preference values within the sets. It will be noticed that the 
optimal arrangement, as shown by Table 2, has ranges of preference values for each 
set which are approximately half the ranges seen for Szondi’s arrangement by sets. 
Since the question of whether Szondi’s sets could have been composed by a selection 
of eight pictures at a time drawn randomly from the whole population of 48 pictures 
would arise, some check was made on this. Six samples of eight pictures each were 
drawn from the 48 Szondi pictures by a random, card-shuffling technique. The 
ranges of the preference values for the sets so derived are approximately equal to 
those found for Szondi’s sets. However, while the median preference values of the 
individual pictures for these randomly drawn sample sets lie around zero, the med- 
ians for the Szondi sets were all higher than those of the random sample with the 
exception of set IT. 

It seemed desirable to go further and investigate the circumstances surrounding 
the shift of a picture in preference value when the two different administration pro- 
cedures were used. From the introduction to this paper, it can be deduced that those 
pictures will show the greatest shifts in preference value which occur in the sets that 
might commonly be described as most “agreeable” ‘or “disagreeable.’’ This liking or 
disliking of a set would be represented by sizeable median preference values for the 
set. It will be observed that Szondi sets II, V and VI have the lowest median prefer- 
ence values. The fact that 28 pictures showing the smallest of the 34 changes in pre- 
ference value under the two conditions of administration occurred among these sets 


Tas_e I. PREFERENCE VALUES FOR THE PICTURES IN SZONDI's SETS 


Picture 


m Mdn. Range 


22 —§.0 53 
a 0.0 56 
Ss 5.5 51 
s 15.0 45 
22 1.0 51 
> 2.0 34 
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Tase IT. ARRANGEMENT OF PicturEs wirH HOMOGENEOUS PREFERENCE VALUES WITHIN Eacu SET 


; Picture 
Arbitrary - 
set No. ‘ y : j Mdn. Range 


23.0 


26 
V II 
4 31 2% : 5 i 36 
II If] Vi F 
-17 —32 ) -16 —8 -18.0 24 


points to the correctness of that deduction. The chi-square for this arrangement of 
frequencies shows the obtained results to be statistically significant at the one per- 
cent level of confidence. 


DiIscussION 


Obtaining the preference values of the pictures with all 48 Szondi pictures pre- 
sented simultaneously has made it possible to study the constitution of each of the 


sets in terms of the preference values of their pictures. It is seen that the Szondi sets 
were biased samples of the whole population of pictures in this respect. While the 
range of the preference values is comparable to those obtained by a random selection, 
the central tendencies of the sets reflect a sampling bias and differ from one another 
considerably. Since the sets do not represent the whole population of pictures, it is 
incorrect to interpret the frequency with which a picture is selected as liked under 
conventional testing procedure as an indication of popularity or valence with refer- 
ence to all the other pictures in the population. Statements of this kind must be 
confined strictly to the relative values within the particular set. 

The optimal diagnostic advantages of homogeneous composition of the sets 
with respect to the preference values of the pictures within have been shown deduct- 
ively. The information obtained for this sample suggests the possibility of making a 
better grouping of pictures within each set so that differences in the preference 
values of the individual pictures are minimized. By this regrouping, the range of 
preference values for each of the sets is almost halved over those for the conventional 
Szondi grouping. This regrouping would meet the desired conditions of the forced- 
choice technique where the items within a particular set have comparable preference 
values. 

The deviant central tendencies of the Szondi sets would be described by the 
subjects as ‘‘ These pictures aren’t very attractive,” or ‘There isn’t much to choose 
from among these.’”’ The opposite sort of statement would occur for the ones that 
deviate in the more preferred direction such as, ‘‘ Now, these are much nicer; the 
others all looked like criminals.’’ Whatever the general value of the set, the subject 
is forced to select two likes and two dislikes under the conventional procedure. It 
will not be surprising to see the likes selected from a ‘‘disagreeable’’ set drop appre- 
ciably in preference value when the selections are made from all 48 pictures at once 
and the forcing aspects of the situation are less severe. He will then be free to dis- 
regard these former choices and proceed to make his 12 like selections from the more 
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,agreeable” sets. The results of this study affirm this deductive reasoning, by de- 
monstrating a high degree of relationship between the absolute amount of change 
for preference values of the pictures in a set and the central tendency for preference 
value of that set. 

Since this obtained relationship was anticipated, it seems unlikely that these 
larger changes in preference values are attributable to chance. This high degree of 
contingency also confirms the adequacy of the sampling in the drawing of the experi- 
mental and control groups. If important group differences were responsible for the 
results, there would be no reason to expect them to be of such a nature as to produce 
this observed contingency. 

In conclusion, it might be well to consider what the Szondi test should be ex- 
pected to do. With the general design of the test procedure, it seems likely that it 
will be most profitable to consider it as a forced-choice technique. As such, it will not 
indicate which pictures a subject prefers (among all 48) but will make the selection 
of pictures depend upon personality differences, since his selections within a set do 
depend upon his preference. Each other set may then be regarded as a replication 
of the first set 

The writers feel that the current Szondi test is less sensitive for measuring the 
personality of college students than it could be. The suggested sets seen in Table 2 
are recommended. It is true that the Szondi is becoming ‘‘standardized”’ with these 
distortions present through each individual Szondi worker’s clinical experience. 
This tends to minimize the importance of the criticisms in this paper, but the instru- 
ment probably cannot afford to introduce unreliability, considering the difficult 
nature of the task of evaluating personality. 


ssa SUMMARY AND CONCLUSIONS 


1. It was the aim of this study to determine the preference values of the Szondi 
pictures for a group of students and apply the results to an analysis of the Szondi 
test procedure as a forced-choice technique. 


2. Eighty naive undergraduate students provided the data on preferences 
through a ranking procedure. The preference value was defined as the number of 
times a picture was disliked, subtracted from the number of times it was liked. Half 
the subjects ranked the pictures by Szondi sets while the other half ranked all forty- 
eight pictures simultaneously. 


3. The median preference values and ranges of the pictures in Szondi’s sets 
are presented. A regrouping into sets with minimum ranges of preference values is 
proposed and the advantages of such a modification in terms of the forced-choice 
technique are discussed. 


REFERENCES 
Guertin, W. H. A comparison of the stimulus value of Szondi’s pictures with pictures of normal 
Americans. J. clin. Psychol., 1951, 7, 163-166. 
2. Ricuarpson, N. W. Forced-choice performance reports; a modern merit-rating method. Per- 
sonnel, 1949, 26, 205-212. 
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PROBLEM AND RESULTS 


According to Szondi Test theory as presented by Deri“? a positive response to 
the pictures of a certain factor indicates a ‘‘conscious or unconscious identification 
with the motivational processes as depicted by the photographs of the respective 
factor.’ Although such a statement allows no evaluation of the frequency with 
which such identification will take place at the conscious level it includes no reason 
to suppose that such a situation will be either rare or significant. It does seem to 
imply however that to some extent positive reactions to a particular vector will stem 
from a positive tone feeling regarding the faces representing that vector. 

Many persons who have administered the Szondi Test have observed that some 
subjects protest when asked to ‘‘pick the two that they like most’’ contending that 
none of the faces are really experienced as “‘liked.’’ The purpose of the present study 
was to attempt an evaluation of the extent to which such a protest reflected a gen- 
uine reaction to the test, and to what extent it might be taken as a moderate ‘‘delay- 
ing action’”’ on the part of the subject. 

The subjects for this study were forty-eight sophomore students in Introductory 
Psychology. The group was administered the group Szondi for five administrations. 
At each administration the subjects, were asked for each slide to indicate the two 
pictures they liked most, and the two they liked least, but were further asked to 
designate for each selection whether it indicated a genuine feeling regarding the face 
in question, or whether it was merely made to conform with the test requirements. 
Thus it was possible to score each subject’s responses in two ways, one a ‘‘conform- 
ing” profile, and secondly an “honest” profile, according to whether the selections 
were merely made to meet the demands for two likes and two dislikes, or whether 
they reflected a real feeling toward the slide. The results are cited in Tables 1, 2, 
and 3. 


TABLE 1. CHANGES IN SzonprI Scores 


Factor 
1D Hy 
Plus 
Conforming profile : Se 11 


Honest profile j2 . 5 


Minus 
Conforming profile 


Hfonest. profile 


Open 
Conforming profile 


Honest profile 


Ambivalent 
Conforming profile 


Honest profile 


*The authors are indebted to Dr. Florence Mateer for the initial stimulus for this investigation. 
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TABLE 2. NATURE OF CHANGES BY TRIAL 


Change in “Honest” profiles, by trials, 
contrasted to ‘Conforming’ profiles. 


Score 1 2 3 4 5 


Plus down down down down down 
44 48 51 64 61 


Minus up down up up up 
6 5 17 6 12 

Open up up up up up 
6S 79 69 s2 Sl 


Ambivalent down down down down down 
30 26 35 24 32 


TABLE 3. CHANGES BY FACTOR 


Changes per trial 


Trial number: 
9 « 


These data clearly suggest that the typical subject in this group was confronted 
with a group of eight faces, none of which appealed to him, and was literally required 
to pick the two he liked least, and the two he disliked least. Two facts are perhaps 
worth noting from the data. First, the same trend seems to hold for all of the eight 
factors. No greater feeling seems attached to the ‘‘more basic’”’ ego factors than is 
found elsewhere. Secondly, ‘familiarity breeds contempt” in this situation, in that 
fewer “genuine’’ positive responses appear in the later trials. This may be a reflection 
of hostility to the task of taking what was for these subjects an apparently mean- 
ingless test. 

From these results the authors feel it is justified to question the extent to which 
a positive response to the Szondi faces may be interpreted as a “conscious identifica- 
tion with the motivational process depicted.”” While recognizing that even in making 
the choice of lesser evil our subjects were utilizing a potentially meaningful selective 
process, it is difficult to attach the term ‘‘conscious identification” to such a grudging 
acceptance. 
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SUMMARY 


This study sought to determine whether the responses to the Szondi Test re- 
flected a genuine attitude on the part of the subject, or were merely a result of con- 
forming to test instructions. It is concluded that, whereas the negative choices are 
relatively genuine, the positive responses reflect conforming behavior. From this it is 
suggested that a subject’s delay in making Szondi choices may stem from a genuine 
difficulty in finding positive selections, and that such positive responses cannot be 
considered to reflect a ‘conscious identification.” 
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INTRODUCTION 


Although relatively rare at present, Anorexia Nervosa has been noted in med- 
ical literature since the seventeenth century, particularly for its syndromal unity. 
The nature and history of this syndrome recently has been reviewed extensively by 
Nemiah® who credits Gull with introducing the term in 1874. Over the years, 
interest in this disorder has been kept alive by others, a total of 226 cases having 
been reported in the literature since Gull’s classic paper. From these Nemiah sum- 
marizes: ‘‘ The typical history of the ideal patient with Anorexia Nervosa: a young 
girl usually in her teens stops eating; as the weeks go by she begins to lose weight, 
her periods cease, and the patient, quite unperturbed by her changing habitus, will 
not heed the anxious urging of her family to eat. Indeed, she lies, hides food or 
secretly disposes of it to avoid their vigilance, and, despite her rapidly increasing 
cachexia, indulges in all manner of vigorous activity startling in the face of her al- 
most morbid appearance. At length, the threats and prayers of her family gone 
unheeded, she falls prey to an infection and succumbs, the victim of her perverted 
ego’... Anorexia should be considered then as not just a loss of appetite but rather 
as a disturbance in eating, ranging from a true anorexia at one extreme to bulemia 
at the other, frequently associated with ...a wealth of other gastro-intestinal symp- 
tomology.” ae 

The several clinical hypotheses which have been offered as to the underlying 
personality dynamics of this syndrome center about the symbolic nature of over- 
eating, and the ensuing guilt—a morbid counter-reaction to oral sexuality. A more 
specific hypothesis suggests that the avoidance of food symbolizes a rejection of 
pregnancy, the act of eating being equated with intercourse by the patient. Such 
hypotheses have been based mostly on the psychiatric anamneses. Systematic 
studies have not been reported supplementing these case histories with psychological 
test data. 

The purpose of the present study was to investigate by intensive clinical study 
three aspects of the behavior of patients with this disorder, which aspects have not 
been previously reported in the literature. These are (1) intellectual functjoning, 
(2) character structure, and (3) underlying emotional functioning as seen in fantasy. 
Intensive clinical studies were carried out on five patients who met the description 
of anorexia cited above. The psychological instruments utilized were the Wechsler- 
Bellevue Adult Intelligence Scale Form I, the Rorschach Technique and the Them- 
atic Apperception Test. 


Cass History Data 


The medical records confirmed the uniformity of this syndrome in six major 
life areas: 


Age and sex. All were single females, ages 15 to 17. 


Onset and progress of illness. Suddenly dieting drastically, the patients de- 
veloped a complete lack of appetite and distaste for food which remained after 
dieting ceased, and they rapidly lost from 30 to 60 pounds in weight, all in less than 
eight months. As this progressed, they would periodically gorge themselves, parti- 
cularly on sweets, only to vomit immediately. None complained of weakness but 
actually became hyperactive. Complete amenorrhea, a sheet-like pallor and pain 
on urination were the usual complaints which brought these patients to the family 
physician. Hospitalization followed the physicians’ inability to handle the problem 


« 
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medically. Several of the physicians had recognized the presence of psychological 
problems in the patient’s family. 


Development history. Previous loss of appetite or weight was denied. While 
there was little record of early infant feeding habits, family concern with dining 
habits, food dislikes and eating between meals were noted. Data about infant toilet 
training also were lacking, but periods of constipation were frequent; one mother 
daily asked about the number and quality of the patient’s bowel movements even 
though she was over fifteen. In other respects, physical development was normal 
through puberty. All patients came from families with obese women; three were 
overweight themselves, the other two having a fear of obesity. All were tall for their 
age group. There was no other history of illness or operations or even familial illness, 
except the usual childhood diseases. 


Family constitution. The mother was depicted as ‘‘dominant, driving, over- 
protective, rigid in her standards, used nagging as the main method of discipline.” 
In three cases there was a running battle between the mother and patient, centering 
around the social and heterosexual standards of the patient’s peers which the mother 
disapproved. This conflict coincided with the onsét and progress of the illness. The 
father, in contrast, was characterized as “fuddy-duddy, silent, hard-working, seldom 
with his children.” All five patients were born when the parents were over thirty. 
Though the sibling pattern differed, the patients’ only admission of familial discord 
was with a sibling (except in one case of an only child). The sibling had achieved 
without unusual effort what the patient unsuccessfully strove after, or cheerfully 
endured daily experiences which frustrated the patient. While the patient was 
plainly the favored child, she completely denied this. 

Social background and attitudes. From a lower:middle class group, residing in 
rural communities, and belonging-to an evangelical type of religious group, the 
patients were all the most devout members of their families. In all respects they 
were Victorian prototypes of strict moral standards. Social isolates, considered 
eccentric by their peers, they were over-achievers at school, with a higher spurt of 
scholastic endeavor upon the onset of their illness. In many respects they were 
“little ladies,” embroidering during their stay on the ward, constantly primping, 
drawing colored flowers, etc. They decried athletics or any show of physical prowess. 
Yet none contemplated marriage before thirty—if at all—and then did not want 
children. In many ways they admitted their envy of males and identified with 
masculine interests in their ‘career ambitions, e.g., to be “doctors,” “‘to go into poli- 
tics” or even to become a “‘ civil engineer.” 


Treatment. Brief psychotherapy attempted in all five cases differed markedly 
in outcome. In the case with the least severe symptoms, exploration of her emo- 
tional attitudes and their connection with her illness was completely resisted and no 
change was noted in the patient’s condition after 90 days of hospitalization. In 
two other cases the girls were finally able to discuss some of their behavior and feel- 
ings, but achieved little genuine insight; they were discharged only moderately im- 
proved with approximately the same number and type of interviews. In the remain- 
ing two, extensive alteration of emotional attitude was achieved and symptoms dis- 
appeared altogether. These two patients continued with weekly outpatient treat- 
ment for over six months. Their therapists considered that in both cases the patients 
had achieved enough maturity to face the continuing problems of adolescence. 


PsYCHOLOGICAL DATA 


Intellectual functioning. The results of the Wechsler-Bellevue scales, presented 
in Table 1, in general show little resemblance among the five cases, in terms of total 
scores or sub-scale patterns. Three specific points of similarity may be noted, how- 
ever: 1. The verbal weighted score is greater than the performance score (in four 
cases); 2. The comprehension score is above the average weighted score; and 3. Sim- 
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ilarities score is above the average. This pattern of higher verbal functioning, high- 
lighted by the tests of conceptual judgment and abstract reasoning, may be con- 
sidered as indicating an over emphasis on “‘intellectualizing” of reality, occasionally 
to the detriment of practical or manipulatory judgment as represented in Picture 
Arrangement. 

Personality structure. As with the Wechsler-Bellevue data, the results of the 
Rorschach, shown in psychogram profiles in Table 2, also were dissimilar. Neither 
global inspection nor specific comparison of the usual scoring categories revealed any 
points of communality. In over-all character structure, then, it must be concluded 
that these are five different adolescents. 

However, despite the lack of similarity among the major Rorschach variables, 
all five of the records contain many partial confabulations. In each such response, 
the subject first seemed to note some detail of the blot which conformed to her 
associative set, but was unable to disassociate this detail from some other simultan- 
cous or previous response. Instead, she tried desperately to connect these disparate 
details. The result was a forced, intellectualized perception of the situation— 
similar to that type of thought process inferred from the score patterns of the more 
structured tasks of the: Wechsler-Bellevue. 


Personality dynamics. The TAT stories were analyzed as to theme, identifica- 
tion with heroes and predominate sources of emotional conflict. Again, each subject 
appeared as a distinctly different individual. Three stories to Picture 6GF may serve 
to illustrate the disparity among these patients on this test: 

M. L. Looks like the man has terrified the girl. He’s going to kill her. 
(Who?) Maybe her father .. . (Why?) Maybe she’s found out something about 
him. 

D. K. Martha is returning to the sofa in the living room, as she has finished 
beautifying herself for the events that are to take place at the big church social. 
Sam, her husband, enters asking whether or not she is ready to go. After re- 
ceiving her affirmative answer, the two get their coats, lock the house and board 
the trolley car. The evening at church is very pleasant ... Martha and Sam re- 
turn, feeling great inner warmth between themselves and a mounting enthus- 
iasm to assist in spreading the gospel of good news to all. 
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F. H. Could that be a pie-anno? Let’s call her Fairybelle. A very sweet 
lady is sitting at the pie-anno. Her husband has been away for days. He’s went 
up for an examination from the Army. Although he should have been back, 
she’s confident he will return. As she’s sitting, she hears a rap at the door. She 
calls, ‘Come in,” but keeps right on playing at her pie-anno. In walks an Army 
officer and tells her that her husband has been killed. Her eyes turn tender. She 
looks surprised but cannot speak. What will she do? ... She goes in search of 
a job. On her return, she sits down, and in order to keep the worry off her mind 
practices on her pie-anno. Outside the window, a movie producer is watching 
her. He’s known her from childhood. He always loved her beauty and at- 
tractive personality. This was his chance. He would have her in his production. 
He walked through the open door. Cautiously, he stepped up to Fairybelle. 
His forehead was wrinkled. ‘ Fairybelle,” he said warningly, “‘ You know your 
husband is dead and you need work. I would like to have you in my screen pro- 
duction. You love to play the pie-anno and you could be the musician in my 
film.’’ Tears came to Fairybelle’s eyes as she thought of her dear husband. She 
would never see him again, but she must work and support herself. She accepted 
the offer. She is now a beautiful actress in the movies. 


While the emotional content of these stories is considerably varied, the flamboyant 
“soap-opera”’ bathos so striking in the Fairybelle story frequently is seen in the stor- 
ies of all five patients. This pseudo-emotionality usually is reflected more in the 
melodramatic style and histrionic language than in the story themes themselves. 
As in the other two sources of test data, an intellectualizing demeanor is character- 
istic of the responses. Here, however, the emptiness of emotionality is most manifest. 


DISCUSSION 


Anorexia Nervosa—A Psychological Entity? Whether or not there is essential 
unity of intellectual and emotional functioning in this disease syndrome cannot be 
definitively decided without more extensive study than was attempted here. The 
results of our study indicate that, while Anorexia Nervosa appears to form a syn- 
dromal unity as far as certain features of the patient’s life history and situations are 
concerned, only one aspect of psychological functioning was exhibited by all these 
five cases. They all demonstrate an emphasis on an intellectualized analysis of their 
environment in an emotionally detached fashion which leaves them partially out of 
touch with reality. Qualitatively, these five records resemble Rapaport’s“ over- 
ideational, schizoid normals. This disturbance of thought process also was noted in 
the behavior of these patients on the hospital ward and was characteristic of their 
behavior at home and at school. 


Some implications for test validity. When confronted with such varied psycho- 
logical results from such an otherwise remarkably similar group of individuals, one 
is tempted to seriously question the validity of these test results. It is true that the 
one suggestive pattern correlates well with observable behavior in life situations of 
these girls, but this explanation is only partially satisfying. For one thing, this 
general dynamic pattern is not specific to Anorexia Nervosa but is characteristic of 
several types of neurosis. One might expect such a specific ‘disease’ entity as An- 
orexia Nervosa to have an equally specific psychological unity. Secondly, this one 
observation as to patterns of thought-processes and emotional dynamics leaves un- 
answered the question as to the genesis of the disorder in terms of specific life prob- 
lems or even types of problems, as hypothesized from psychiatric data. 

Perhaps we expect too much of our present state of psychological knowledge 
when we demand that the observable life-behavior data and the projections of 
fantasy fit neatly into one pattern. ‘‘ Disease’’ entities, even when characterized by 
such uniform symptom syndromes, are not necessarily psychological entities, i.e., 
are not necessarily outer manifestations of the same emotional problems or even of 
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ways of thinking. It has been repeatedly demonstrated that overt symptoms, such 
as alcoholism or ulcers, may arise from any one of a variety of personality configura- 
tions. For example, Lhamon and Saul? have recently emphasized, “ Personality 
was thought of as a unit, and in early work in psychiatry and psychosomatic medi- 
cine attempts were made to correlate organic symptoms with aspects of the whole 
personality.’”’ Further, Alexander is quoted in a statement of present-day opinion— 
“What we find characteristic is not so much a certain personality type as a typical 
conflict situation which may develop in very different personalities.” The disorder 
under consideration in this study may thus be taken as a paradigm of this viewpoint. 

While projective techniques probably never can be validated against manifest 
behavior in the usual sense of the term validity, the relationship between the two 
kinds of behavior, and their divergences, would seem a legitimate object of investi- 
gation. The result of this investigation, i.e., finding a common pattern of thought 
process may be thus viewed as such a relationship, and neither as disproving the 
unity of the disease syndrome nor contesting the validity of the psychological tech- 
niques. 

It may be observed, however, that at least two of the techniques used are chiefly 
methods of analyzing thought process rather than being concerned with the content 
of the emotional problem. While the TAT often offers a wealth of data concerning 
the specific attitudinal patterns within the structure of the personality, it is still, in 
the opinion of these authors, too unrefined a tool for exact analysis. Perhaps we need 
more specific study of symbolic content of Rorschach and TAT data. Such investi- 
gation would, of course, demand a clarification of current psychological concepts 
about symbolic content. 

Without attempting in this brief paper to deal further with the difficult prob- 
lems of the validation of personality interpretation, one additional observation from 
this study may be allowed. While the phenomenon of the related life facts in these 
five cases was only scantily explained, it was possible to predict in every case the 
different outcomes of psychotherapy. This fact strongly suggests that the behav- 
ioral data of the projective techniques may more closely resemble the type of psycho- 
logical functioning occurring during psychotherapy than serve to explain a set of 
concrete facts in the individual’s life. 


SUMMARY 


Anorexia Nervosa is noted for its uniformity as a symptom syndrome. Five 
adolescent girls who showed this pattern of symptoms in six areas were studied in- 
tensively with three psychological techniques: the Wechsler-Bellevue Adult Intelli- 
gence Scales, Rorschach Technique and Thematic Apperception Test. An over- 
intellectual, over-idealized approach was noted in their thinking. Otherwise these 
five youngsters seem very different personalities. 

It was pointed out that this type of thought process does not distinguish An- 
orexia Nervoga from other neurotic symptom patterns. This study is viewed as 
further evidence that a direct relationship between manifest behavior and total 
underlying dynamics may not be expected. Investigation of the content of fantasy as 
dynamic prototypes of surface behavior is suggested as a means of relating “char- 
acter structure” with overt behavior. Successful prediction of the outcome of psy- 
chotherapy on the basis of the projective techniques suggests that the behavior 
sampled by these tests probably is more related to the emotional process of therapy 
than to any static set of life events. 
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A STUDY OF THE PSYCHOLOGICAL FACTORS IN PRIMARY 
DYSMENORRHEA 
FREDERICK I. HERZBERG 


University of Pittsburgh 


PROBLEM 
Dysmenorrhea, or painful menstruation, ranks as a major dysfunction among 
the women of today and offers one of the most difficult problems encountered in the 
practice of gynecology. Nothing said on the subject is free from contradiction. There 
is disagreement of opinion as to its incidence, cause, and treatment. The incidence 
of dysmenorrhea is reported by some to have increased in recent years. Hunter and 
Rolf® claim that the increase stems from the fact that modern mechanical in- 
ventions have relieved women of much drudgery, thereby giving them more time for 
introspection. On the other hand, Meredith™, Van Duyne®®, and Clow @?, on the 
basis of comparative studies over a number of years, report a sharp decrease in 
dysmenorrhea. They attribute this decline partly to a healthier attitude toward 
menstruation. Their studies, however, were conducted on too narrowly selected 
groups over twenty years ago. The exact prevalence today is difficult to determine 
because of the multiplicity of subjective factors which enter into the investigation. 
Medical literature abounds with discussion of the etiology and methods of treat- 

ment of dysmenorrhea. For each of the numerous causes that have been postulated 
there exists a variety of means to correct it. However, the results of therapy have 
been far from constant and satisfactory. Of the many suggested etiologies for pri- 
mary dysmenorrhea the following are the most frequently mentioned: 

(a) imbalance ia the normal estrogen-progestin equilibrium 

(b) hypoplasia of the uterus 

(ce) eadocrine imbalance 

(d) psychic factors 


The experimental evidence has not as yet supported any of these possibilities. 
The psychic factors have not been put to good experimental test. Wengraf“? sums 
up the psychogenic case for dysmenorrhea by offering the following characteristics 
of this disorder of menstruation: 

(a) absence of pathology 

(b) imeonstaney of pain 

(ec) reversibility of pain 

(d) presence of other neurotic stigma 

(e) samenability to a variety of treatments 
(f) suppression by psychotherapy 


The clinical evidence that mental factors may underlie or be a concomitant of 
menstrual pain has received little support from objective attempts at correlating 
the two variables to see if they are significantly related. This study is undertaken 
to determine whether there is a measurable personality deviation in women who 
suffer from primary dysmenorrhea. 


METHOD 

Subjects. The subjects were drawn from two Pittsburgh department stores, a resi- 
dence home for business women, and the Young Women’s Hebrew Association. It 
Was necessary to contact many more women than were used in this study in order to 
obtain sufficient, relevant and acceptable data. Only a small percentage of the pop- 
ulation of the places used were willing or able to cooperate. Of those who did, a large 
number were eliminated for failing to meet the criteria demanded by the study. An- 
other sizable number failed to complete all parts of the experiment. This was to be 
expected since in most cases contact had to be kept for over a month. Of the total 
number of women contacted, forty nine dysmenorrheal and fifty one non-dysmenorr- 
heal subjects were included in the analyses. 
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Procedure. A dysmenorrheal and a non-dysmenorrheal group of women of compar- 
able age, occupation, marital status and Binet vocabulary level were given the Min- 
nesota Multiphasic Personality Inventory (hereafter referred to as the MMPI). As 
far as possiblethe MMPI was given before the ovulative phase of the menstrual cycle. 
This was done because studies by Rosensweig®? and Altmann“? have shown that 
there are psychological concomitants to the various phases of the cycle. Both agree 
that the preovulative stage manifests the healthier mental state. Due to the difficulty 
of accurate charting of the subjects and the inability to contact some subjects at the 
right time, there were a number of exceptions to this procedure. However, the test 
was not administered during the five days preceding menstruation or during the 
menstrual period itself. These times both Rosensweig and Altmann again agree are 
the most undesirable from the point of view of psychological well-being. 

The diagnosis of dysmenorrhea was made on the basis of a questionnaire and a 
personal interview. The interview enabled the experimenter to administer the Binet 
vocabulary and provide an opportunity to clarify the questionnaire and gain some 
information about the personal history of the subjects. All subjects scoring below 
eighteen words on the vocalulary test were eliminated from this study. The cutoff 
point of eighteen words was arbitrarily set at one-half sigma below the mean of the 
standardized group of sixteen year olds presented by MceNemar“?. The equating 
of the groups on the vocabulary level and the elimination of those under eighteen 
was felt necessary because of some evidence that at least an average intelligence was 
requisite for the completion of the Minnesota Multiphasic Personality Inventory. 

Means, standard deviations, and standard errors of means for each of the nine 
scales of the MMPI were computed for the experimental and control groups. Critical 
ratios utilizing Student’s ‘‘t’’ were then computed to determine the amount and 
nature of the difference between the groups. 

The percentage of subjects scoring above seventy (considered to be outside the 

normal range) on any of the scales, together with the standard errors of the per- 
centages, were computed for the groups studied. Critical ratios between these per- 
centages were then determined, again utilizing the Student’s “‘t’’ as a test for sig- 
nificance. 
Diagnosis. Primary (essential, intrinsic, idiopathic) dysmenorrhea may be defined 
as painful menstruation in the absence of gross pathological conditions in the pelvic 
organs. Secondary (acquired, extrinsic) dysmenorrhea may be defined as painful 
menstruation as the result of pelvic lesion. 

A questionnaire calling for information on the subject’s menstrual history was 
filled out by the subjects on one of the days of their menstrual period in order to 
standardize the time and get the most accurate report. At the department stores 
this was carried on at the dispensary when an employee came for rest or medication 
for dysmenorrhea. The control subjects were gathered from visitors to the dispensary 
who came there for other purposes and from volunteers provided by the personnel 
manager. At the business girls’ residence home and at the YWHA both control and 
experimental subjects were volunteers. In all cases the questionnaire was completed 
during the menstrual period. 

Following the return of the questionnaire, the experimenter interviewed the 
subject eliciting information that would clarify and supplement the information al- 
ready given on the questionnaire. Some of the additional information was the sub- 
ject’s conception of and attitude toward menstruation (beginning and present), an 
idea of the location and type of pain felt, and circumstances under which it varied. 
The following are the criteria which determined the diagnosis of primary dysmen- 
orrhea for the experimental group and the criteria for control subjects: 

CRITERIA FOR EXPERIMENTAL GrouUP 
age of subject at least four years past age of her menarche and not 
older than thirty 
no children 
menarche between eleven and fifteen 
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report of more than a moderate amount of pain 
indication of at least three of the five behavioral manifestations of 
dymenorrhea called for in the questionnaire: 

(1) cancelled social engagements 

(2) used an analgesic 

(3) forced to lie down and go to bed 

(4) became nauseous 

(5) consulted a physician 
doctor’s diagnoses, if available 
physical appearance did not appear underdeveloped 
corroboration of the above from interview 


CrirertA For Conrrot Groupe 
age of subject at least four years past age of her menarche and not 
older than thirty 
no children 
menarche between eleven and fifteen 
report of little or no pain 
indication of no more than one of the five behavioral manifestations 
of dymenorrhea called for in the questionnaire, not including con- 
sulting a physician. 
(f) corroboration of the above from interview 


The basal age of four years past the menarche was chosen to allow sufficient 
time for the subject’s menstrual cycle to become operative with regularity. After 
thirty the possibility of acquired dysmenorrhea increases. The limits to the age of 
menarche were set at the normal range of the onset of menstruation in American 
women. Too marked a deviation from the mean age of the menarche may indicate 
endocrine imbalance and therefore suggest a secondary dysmenorrhea. The elimin- 
ation of parous women was again an attempt to cut down the incidence of secondary 
dysmenorrhea, as was the elimination of underdeveloped physical types. The allow- 
ance of ‘little’ pain for the control group was necessary because of the difficulty of 
differentiating the norma] minimal symptoms of menstruation from pain on the 
part of the subject. The interview helped greatly in deciding whether the subject 
was to be included or not. Organic dysmenorrhea could not be completely eliminated 
for even a pelvic examination may fail to do this. The procedures followed attempted 
to decrease as much as possible the incidence of secondary dysmenorrhea. 


RESULTS 

Table 1 presents critical ratios showing the dysmenorrheal group scoring sig- 
nificantly higher on four of the scales, three of them at the one per cent level of con- 
fidence. These latter scales are Hypochondriasis, Psychopathy, and Depression. 
At the five per cent level of confidence Psychasthenia was greater for the experi- 
mental group. Psychopathy, though not having the highest critical ratio, is perhaps 
the most significant since the mean for the experimental subjects was found to be 
more than a sigma above the general population. The dysmenorrheal women tend 
also to be more depressed and hypochondriacal. In addition they indicate psychas- 
thenic behavior to a greater degree than the non-dysmenorrheal group. 

Perhaps the clearest delineation between the groups is indicated by the very 
high percentage of experimental subjects scoring above the normal range on one or 
more of the scales. The difference in percentages scoring above seventy between the 
experimental and control groups is again significant at the one per cent level of con- 
fidence. The percentage of each group scoring above the critical score of seventy 
on any of the scales is presented in Table 2. 


Subjective Reports. The available interviews presented a wealth of qualitative in- 
formation of such a nature that the most psychologically naive would see the impli- 
cations that psychological factors played in the disorder studied. Unfortunately the 
time factor and the necessity of keeping the best rapport with the subjects pro- 
hibited the experimenter from taking verbatim notes. However, the gist of the 
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TABLE 1. Mean T-Scores, STaNDARD DEVIATIONS, AND “t’””? VALUES FOR SCALES OF THE MINNESOTA 
Mv TIPHASIC _PERSON ALITY Inv ENTORY FOR Grou PS Srv DIED 


— 
} 


Dysme nobel Group _ Non. Dysmenor rheal Group " 
Mean | Sigma | Mean Sigma 1% Value 

| 

| 





Hypochondriasis | 52.9 8.3 | 48.5: 7.54 2.74** 
Depression 56.49 mf | 50.72 8.19 
Hysteria | 55.31 9.66 51.92 69 1.94 

Psychopathy 61.98 53 | 56. 9.14 | 3.04** 


3.12** 


Masculinity-Femininity | 50.65 10 49.: 33 | 0.6: 


Paranoia 53.53 46 | 50.92 9.18 32 
Psychasthenia | 56.78 | 63 | #4 .82 | 2.06* 


Schizophrenia | 56.16 .92 | §4.7% 9.12 
Mania | 58.10 99 56.53 7.52 





~~ *Mean Difference Significant of 5% Level 

**Mean Difference Significant at 1% Level 
material gathered from the interview can be summarized from the following general- 
ized information that the dysmenorrheal women offered: 

(a) Many of the subjects suffering from dysmenorrhea have always consid- 
ered pain to be a normal concomitant of the menstrual period. These women often 
expressed surprise when advised that this was not true. 

(b) <A large number of the dysmenorrheal women experienced a definite shock 
upon the commencement of their menses. 

(c) The often expressed statement that women have always viewed their 
menstrual period as one of invalidism was again brought out by the interviews. 

(d) In explaining the fluctuation in their pain the subjects indicated that the 
pain ‘“‘had to go”’ when something more important presented itself. 

(e) More on the quantitative side was the fact that, of the five behavioral 
manifestations of dysmenorrhea called for in the questionnaire, foregoing social 
engagements was the one least admitted. 


TasBLe 2. Per Cent or Eacu Group FALuinc ABOVE 
T-Score 70 ON_ _ANy 0 OF THE NINE ScALEs. 


Pe r Cent 
Dysmenorrheal 53 
Group 
Non-Dysmenorrheal 18 
Group 
(This difference is significant at the one per cent level of confidence.) 





DIscUssION 

This study presents objective and quantitative data to substantiate the suspected 
psychological association in the menstrual disorder of primary dysmenorrhea. Of the 
four clinical syndromes that the dysmenorrheal women were “significantly higher on, 
psychopathy, by virtue of its high mean, seems to be the one most relevant. It is 
difficult to expl: iin why this clinical classification is the one most prominent among 
dysmenorrheal women. A possible explanation may be that the sexual area that 
psychopathic persons are so often troubled in is the same area which much of the 
literature has postulate d as the seat of the psychological difficulties of the dysmen- 
orrheal patient“: * "4. As for the other three clinical variables (hypochond- 
riasis, depression and psych: isthenia) in which there was a significant deviation be- 
tween the groups, the means were well within the range of normalcy and it is difficult 
to assign any meaning to them. 
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father than trying to pigeonhole the dysmenorrheal women into some psy- 
chiatric classification it is perhaps more indicative of the emotional instability of 
these women that more than half of the experimental group had at least one score 
above the normal range. This seems to present a more logical explanation of the 
psychological factors. Certainly having dysmenorrhea in common does not give these 
women common personalities, and though they may tend to be rather emotionally 
unstable as a group there are too many other variables that enter into the picture to 
determine the symptoms any one individual may manifest. 

As for the crucial question, which comes first, the psychological disturbance or 
the pain, the data revealed by this study can hardly offer an answer. A solution can 
possibly come only from an experiment in which induced emotional instability is the 
independent variable and the dysmenorrhea following is the dependent variable. 
The multitude of variables that may enter into such an experiment precludes the 
success of any such undertaking. Horney’s“? opinion that the material revealed 
from psychoanalysis is the only approach to the problem is a possible but far from 
crucial attack on the question. 

It is the opinion of the writer that the removal of negative values that are at- 
tached to all phases of menstruation on the one hand, and the recognition by physi- 
cians of the psychosomatic nature of the disease in their approach to the dysmenorr- 
heal patient on the other, will go a long way toward making dysmenorrhea a minor 
malfunction of women, 


SUMMARY 
1. This study was undertaken to determine if there were any measurable 
personality deviations accompanying primary dysmenorrhea. 
2. Two comparable groups of women; one indicating the presence of primary 
dysmenorrhea, and a control group consisting of women who were never bothered 


with menstrual pain, were tested for emotional instability by the Minnesota Multi- 
phasie Personality Inventory. 


3. The results indicated that the dysmenorrheal group scored outside the 
normal range, as set by the standardization population of the test, significantly more 
times on one or more of the scales than the non-dysmenorrheal group. There were also 
significant differences on the hypochondriasis, depression, psychopathic deviate, and 
psychasthenia scales in the direction of increased manifestation of the symptoms of 
these clinical variables for the experimental subjects. 

4. With due regard to the limitations of this experiment, the author feels justi- 
fied in viewing the results as evidence for the psychosomatic nature of primary dys- 
menorrhea. 
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A COMPARISON OF THE PERFORMANCE OF STUTTERERS AND NON- 
STUTTERERS ON THE ROSENZWEIG PICTURE-FRUSTRATION TEST 
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University of New Mexico 


INTRODUCTION 

Numerous authorities“: 2,3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 20, 21, 22, 23, 2%) in the 
field of speech maintain that psychological maladjustment is intimately related to 
stuttering. Of especial interest to the present paper, for example, is Abbott’s“ 
contention that unconscious guilt feelings, resulting from repressed hostility to the 
listener, may explain the reluctance of some stutterers to discard their secondary 
mechanisms during treatment, and that these mechanisms may fulfill the need of the 
stutterer for self-punishment as an atonement for his hostility. Also, speech correct- 
ionists have become increasingly aware of the importance of frustration in the etiol- 
ogy and development of stuttering. A recent publication by Van Riper @*), for ex- 
ample, contains at least 14 statements in which frustration is associated with stut- 
tering. 

If then, the hypotheses that (a) stuttering is a manifestation of some psycho- 
logical maladjustment and (b) frustration may be both a primary source and end 
product of maladjustment are accepted, then a study of the role which reactions to 
frustration may play in relation to stuttering is indicated. It is the purpose of this 
study therefore to compare the responses of a group of stutterers on the Rosenzweig 
Picture Frustration Study with a normative group to show any significant differences 
either in the direction of aggression or in the type of reaction to frustration. 


METHOD 
Twenty-five stuttering subjects were used for this study; 13 were high-school 
students, 7 were students at The University of New Mexico, and 5 were adults living 


in the city of Albuquerque. All subjects were 14 years of age or older, the range was 
from 14 to 59 years, with a mean of 23.3 years; 20 were males and 5 were females, a 
proportion commonly found. All of the subjects considered themselves to be stut- 
terers or former stutterers, an arbitrary rule having been set up to accept only in- 
dividuals whose stuttering had begun previous to their 12th birthday. This was con- 
sidered necessary to eliminate cases of very recent origin which might later have 
proved to have been only a temporary dysfunction. The material in Table 1 gives 
a description of the 25 subjects as regards sex, age, personal evaluations of stuttering 
and Rosenzweig P-F Study results. 

The materials used were the Rosenzweig test “®) and a brief subjective question- 
naire on stuttering. This type of questionnaire was chosen for two reasons: (a) a sub- 
jective evaluation by the stutterer of the severity of his stuttering was considered 
more pertinent to the purpose of this study than an objective evaluation; (b) in 
many cases the degree of severity of stuttering had reached its greatest point at some 
time in the past. Only the personal data and responses to three questions were used 
for purposes of this paper. The three questions were: “‘ At its worst, did you consider 
your stuttering to be mild, medium or severe?”; ‘‘Do you consider yourself to be a 
stutterer now?”; and, “If so, do you consider your stuttering at present to be mild, 
medium or severe?” 

The subjects were assured of the anonymity of their results, that the test was 
not one of intelligence, that there were neither right nor wrong answers, and for 
those in school, that the test had pu relationship to the school. The P-F Study was 
administered and scored according to the standard instructions given by Rosen- 
aweig “5, Only two replies were considered unscorable according to his standards. 

Since Rosenzweig has published data on a group of 460 normal subjects “®, his 
normative group was used as a basis for comparison. Male and female subjects were 
contrasted with Rosenzweig’s male and female subjects separately, and then the 
total stuttering group was compared with Rosenzweig’s total group. The stutterers 
were then divided according to severity of stuttering during its worst period, at the 
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present time, and also according to degree of improvement indicated. The latter 
measure, which is admittedly quite rough, was derived by contrasting the evalua- 
tion of stuttering at present with stuttering at its worst. A change from severe to 
medium, medium to mild, or mild to cured was considered one degree of improve- 
ment; from severe to mild, or medium to cured, two degrees, etc. Comparisons were 
then made of these sub-groups of stutterers with each other. 

All comparisons noted above were performed on the six categories of the P-F 
Study: Intropunitive, Extrapunitive, Impunitive, Obstacle-Dominance, Ego- 
Defense, and Need-Persistence, as well as on the Group Conformity Rating (GCR). 
The ‘‘t” test was used throughout to test significance of differences. 


RESULTS 
A. COMPARISON OF STUTTERERS WITH NORMALS 

Table 2 presents the results of the comparison of the stutterers with Rosenz- 
weig’s normative group for the sexes separately and for the total groups. It is im- 
mediately apparent that the stutterers, male or female or total group, are less Extra- 
punitive and more Intropunitive than the normal group. (Results for females should 
be evaluated in view of the very small n of five cases in this study.) Statistically 
significant differences, at the 5°, level or better, exist between male stutterers and 
normal males, and between total stutterers and total normals in the Extrapunitive 
category, and between stutterers and normals, regardless of grouping, in the Intro- 
punitive category. Since Extrapunitiveness and Intropunitiveness are opposites, 
these results are consistent. The Impunitive category yields nothing of significance 
in the way of differences between the two contrasted groups. 


TaBLe 2. COMPARISON OF STUTTERERS WITH ROSENZWEIG’S NORMATIVE GROUP 

Normative Group | Stutterers 
: Difference 
M=236 F224 M=20 | F=5 between 

Category sex | Mean 5.D. Mean D. Means 


M 
E-xtropunitive F 


13.3 38.47 6.5: 
Total 


13.1 | 35.48 : .52 


45 
45 : 
45 13.2 37.87 4. 7.13 


M 28 8.25 32.73 
Intropunitive F 28 35.34 
I Total 28 . 33.25 
M 28.81 
Impunitive F 29.18 
I Total + 8 28 
Obstacle- N 8 12. 
Dominance F : . 20.- 
(O-D) Total : 13 
kigo- M 53 3 3 5d. 
Defense F Hy 10.2 46.65 
(k-D) Toial 52.8 10.8: 55. 
Need- M y 10... 30 
Persistence , k 10.% 32 
(N-P) Total ‘ 10.: 30.82 


Group Conform- M Te 66. 
ity Ratio F 67 10 66 
(GCR) Total 67.5 11 


* Significant at the 5% level. 
** Significant at the 2% level. 
*** Significant at the 1% level. 
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We also find, according to Table 2, that normal males are significantly higher 
than stuttering males in the O-D dimension, as is the total normal group when con- 
trasted to the total stuttering group. The ‘t’s” here are highly significant, better 
than the 1°; level. Stutterers as a group indicate a higher score than normals in 
Need-Persistence, this difference being significant at least at the 5°; level. The E-D 
and GCR scores yield nothing of significance. 


B. Comparison OF VARIOUS SuB-Grovups OF STUTTERERS 

When various sub-groups of stutterers were compared, little of significance 
emerged. Thus, in comparing stutterers grouped according to severity of stuttering 
at its worst, or grouped according to severity at present, or according to self-reported 
improvement, all “t’s’’, excepting one, were insignificant. This ‘‘t’’ resulted from the 
comparison of self-ratings of stuttering at its worst; 4 cases rating themselves as mild 
were added to 13 self-rated as medium, and these 17 were compared to 8 self-rated 
as severe. The latter group, in the O-D dimension, had a mean of 17.75, whereas the 
former’s mean was 11.94, the “t” of this difference being 2.41, significant at the 5°; 
level. This is a reversal of the expected trend in that the mean of the severe group 
is closer than that of the less severe group to that of the normative group, whose 
mean was 21. However, this finding may be a result of either the subjective judg- 
ments used or our method of grouping or both. Further study of a much larger 
number of cases is definitely indicated. 

Discussion OF RESULTS 

Our most significant findings, from Table 2, are that stutterers as a group have 
a significantly higher mean score in Intropunitiveness, a significantly lower mean 
score in Extrapunitiveness, a significantly lower mean score in Obstacle-Dominance, 
and a significantly higher mean score in Need-Persistence. In other words, the 
normals (compared to stutterers) might be said to be Extrapunitive-Obstacle- 
Dominant, while the stutterers (compared to normals) might be said to be Intro- 


punitive-Need-Persistent. The Extrapunitive-Obstacle-Dominant category (or EL’) 
is described by Rosenzweig as follows (italics in all quotes ours): 

“The existence of the frustration is emphasized and the bystander sometimes made un- 
comfortable by the arousal ef his guilt or sense of responsibility. The paradigm for this 
scoring factor, expressed in a generalized response, is: ‘This situation definitely frustrates 
(annoys, inconveniences, disturbs) me.’ 8, p. 9 


The Intropunitive-Need-Persistent category (or 7) is delineated by Rosenzweig 
in these terms: 
“Reparation from an avowed or implied sense of guilt is offered; z.e., the subject takes it 
eee - ye, 
upon himself to solve the frustrating preblem. Paradigm: ‘I assume responsibility for remedy- 
ing this situation.’ ”’ ‘18, p. 10 


Thus Abbott’s® hypothesis noted earlier, that stutterers have a need for self- 
punishment appears to be verified. The fact that stutterers have been noted to have 
depressive trends“! ') and a keen sense of inadequacy “® also takes on fuller mean- 
ing. It is of interest, however, to compare Rosenzweig’s statements about E’ and 7 
with those of Fenichel regarding stutterers: 

“Psychoanalysis of stutterers reveals the anal-sadistic universe of wishes as the basis 
of the symptom. For them, the function of speech has an anal-sadistic significance. Speaking 
means first, the utterance of obscene, especially anal, words, and second, an aggressive act 
directed against the listener . . . Behind (the stutterer’s) apparent zeal he has concealed a 
hostile or sadistic tendency to destroy his opponent . . . and the stuttering is both the block- 

ing of and the punishment for this tendency.” 6, p. 31) 


He states further that the personality make-up of the stutterer is identical with that 
of the compulsion neurotic, in that they use their symptoms to gratify the demands 
of an oversevere superego. And, in discussing the “double front’’ of the ego in com- 
pulsion neurosis, he states further that: 
“The regression to anal sadism has not only modified the ego . . . it has also modified the 
superego itself, so that it becomes more sadistic and presents automatic and archaic features 
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.. . The sadism of the superego, resulting from the regression, increases the more the ego 
refrains from externally directed aggression. One might suppose that a person strict with him- 
self and outwardly unaggressive may be refraining from aggression because of his strictness; 
actually the blocking of (he aggression is primary and the strictness of the superego secondary ; 
the sadism, no longer directed against objects, is turned inward as the superego’s aggression 
against the ego.” (6, p. 91) 


Thus, it is possible, if the analysts are correct that stuttering is compulsive and 
anal-sadistic, and that this results in a “turning inwards of the superego’s aggression 
against the ego’’, then we may have a rational accounting for the significant differ- 
ences in Table 2 which we have found between stutterers and normals. 


SUMMARY AND CONCLUSIONS 

Twenty male stutterers and 5 female stutterers were compared on the Rosenz- 
weig Picture-Frustration Study with Rosenzweig’s normative group. The outstand- 
ing results of this study were: (a) Stutterers are found to be significantly more Intro- 
punitive than non-stutterers; (b) non-stutterers score significantly higher in Ob- 
stacle-Dominance; (c) stutterers show significantly higher Need-Persistence than 
the normative group. These findings seem to correspond to the psychoanalytic con- 
tention that stuttering is essentially compulsive in nature, with anal-sadistic tend- 
encies resulting in a turning inwards of aggression. When stutterers are classified ac- 
cording to severity of stuttering at its worst to severity of stuttering at present or 
according to degree of improvement, very little of significance or difference among 
these sub-groups emerged. 


REFERENCES 
Assort, J. A. Repressed hostility as a factor in adult stuttering. J. speech Dis., 1947, 12, 428- 
430. 
Brown, F. W. Stuttering: its neuro-physiological basis and probable causation. Am. J. 
Orthopsychiat., 1932, 2, 363-371. 
Cortat, I. E., in Hann, E. F. (ed.). Stuttering: significant theories and therapies. Stanford: 
Stanford Univ. Press, 1943. 
Despert, J. E. Stuttering: a clinical study. Amer. J. Orthopsychiat., 1943, 13, 517-525. 
Duncan, M. H. Home adjustment of stutterers versus nonstutterers. J. speech Dis., 1949, 14, 
255-259. 
FENICHEL, O. The psychoanalytic theory of neurosis. New York: Norton, 1945. 
Fiercuer, J. M. The problem of stuttering: a diagnosis and plan of treatment. New York: Long- 
mans, Green, 1928. 
FroscuE.s, E. Twentieth century speech and voice correction. New York: Philos. Library, 1948. 
GirrorD, M. F. Correcting nervous speech disorders. New York: Prentice-Hall, 1939. 
GreENE, J. S. in Haun, E. F. (ed.). Stuttering: significant theories and therapies. Stanford: 
Stanford Univ. Press, 1943. 
INGEBREGTSEN, E, Some experimental contributions to the psychology and psychopathology of 
stutterers: Am. J. Orthopsychiat., 1936, 6, 630-649. 
Jounson, W. Because I stutter. New York: Appleton-Century, 1930. 
Jounson, W. The influence of stuttering on personality. Univ. Ja. Studies in Child Welfare, 
1932, 5, No. 5. 
Jounson, W. The influence of stuttering on the attitudes and adaptations of the stutterer. J. 
Psychol., 1934, 5, 415-420. : 
Kravsz, k. O. Is stuttering primarily a speech disorder? J. speech Dis., 1940, 5, 227-231. 
Krovt, M. H. Emotional factors in the etiology of stuttering. J. abn. soc. Psychol., 1936, 31, 
174-181. 
Purcuir, 8. N. Why stammerers suffer. J. speech Dis., 1947, 12, 419-420. 
RosENzWEIG, 8. The picture-association method and its application in a study of reactions to 
frustration. J. Personal., 1945, 14, 3-23. 
RosENzWeEIG, 8. Revised norms for the adult form of the Rosenzweig Picture-Frustration 
Study. J. Personal., 1950, 18, 344-346. 
SoLtomon, M. Stuttering as an emotional and personality disorder. J. speech Dis., 1939, 4, 
347-357. 
SreEr, M. O., & Jounson, W. An objective study of the relationship between psychological 
factors and the severity of stuttering. J. abn. soc. Psychol., 1936, 32, 36-46. 
Tartar, G. Report of a case of stuttering as a problem of vocational readjustment. J. abn. 
soc. Psychol., 1928, 23, 52-58. 
Van River, C. Stuttering. Chicago: Nat’l. Soc. Crip. Child. and Adults, 1948. 
Witton, G. Howto overcome stuttering: a guide to speech control in conversation and public speak- 
ing. New York: Harpers, 1950. 





A COMPARISON OF PERSONALITY TEST SCORES AND MEDICAL 
PSYCHIATRIC DIAGNOSIS BY THE INVERTED FACTOR TECHNIQUE* 
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PROBLEM 


The problem of the quantitative representation of clinical data has been greatly 
simplified since the discovery of the inverted factor technique. It is now possible to 
define trait universes which can be sampled to describe, as a statistical distribution, 
certain aspects of personality which could be formerly dealt with only intuitively. 
The purpose of this study is to compare personality test scores and medical psy- 
chiatric diagnoses by the inverted factor technique which is concerned with a popu- 
lation of n individuals each of whom has been measured by m tests. 


EXPERIMENTAL DESIGN 


The author administered four paper and pencil personality tests including the 
Bernreuter Personality Inventory, the Bell Adjustment Inventory, the Guilford 
Martin Series, and the Minnesota Multiphasic Inventory to twenty five neuropsy- 
chiatric patients at the Mare Island Naval Hospital in northern California. The 
only criterion for selection of patients was that there be an equitable distribution of 
diagnoses and that all have an IQ of ninety or above on the Wechsler-Bellevue In- 
telligence Seale. All scores were converted into the T scores of the MMPI with a 
mean of 50 and standard deviation of 10. By use of the Pearson product moment 
correlation coefficient, a correlation table was set up using the patients as variables 
and the groups of tests as sample. By means of the centroid method of Thurstone, 
four factors were extracted, and rotated into meaningful positions. This is presented 
in Table 1. Case histories and final diagnoses of the patients were obtained which 
were a composite of the findings of a board of three psychiatrists. The extremely 
difficult task of naming the factors was accomplished by abstracting and “teasing 
out” from the case histories common psychopathological traits for those cases which 
had the highest factor loadings on each factor, and taking the common symptoms 
and determining which symptoms formed a syndrome. Tentative names for each 
factor were derived and finally a singke syndrome for each factor. 


DESCRIPTION OF INDIVIDUAL FacTrors 


Factor I. Surveying the symptomatology of both types of cases (positively and 
negatively loaded cases), the following facts can be observed. Although the range 
of symptoms is wide on the positively loaded cases, there is a lack of definite psy- 
chotic signs. These symptoms could occur in a wide variety of syndromes but sug- 
gest anxiety neurosis. The symptoms on the negatively loaded cases, on the other 
hand, appear to fit the psychopathic personality pattern, but more in the psychotic 
domain than the nonpsychotie (e.g. hostility, confusion, delusions of self-reference, 
inability to get erection). The question whether severe anxiety neurosis and psycho- 
pathie personality (psychotic) are on opposite poles of psychiatric classification is a 
moot one. A review of the literature of the clinical syndrome ‘‘psychopathic person- 
ality” reveals considerable variability in description © ®& 7) 8%, 
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A COMPARISON OF PERSONALITY TEST SCORES 
TABLE 1. Resutts oF CENTROID ANALYsIS OF CORRELATIONS 


Original Factor Loadings 


Rotated Loadings 
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192 5 099 131 4280 20 Os 57 . 4100 
‘The symptom of anxiety was common to both the positively and negatively 
loaded cases in several instances. There were a striking number of cases on the nega- 
tively loaded portion of this factor in which anxiety is present and yet anxiety is not 
unknown in the irresponsible psychopath. However, it was felt that additional terms 
were needed to qualify the two syndromes. Consequently, Alexander and French’s 
psychoanalytical terms ‘‘undercontrol’’ and ‘‘overcontrol’’ were used as additional 

descriptive terms ©, 


Factor IIT. An inspection of the case histories and summaries of the positively 
loaded cases on this factor revealed that all cases were either psychotic or nearly so. 

Case * 18 was the only negatively loaded one of the group on Factor II. Al- 
though he showed schizoid trends, ‘‘on admission he was in excellent touch with 
reality and was correctly oriented”. He showed normal psychomotor activity and 
conversation was coherent, relevant and well organized. This patient was the one 
in this group who was in almost complete remission, and although possessing a psy- 
chotic diathesis, would be the type who at the time of taking these tests would 
answer a paper and pencil test as if he were normal. 

The choice of naming this factor resolved around three similar titles: (1) Severe 
paranoid tendencies, (2) Delusions of self reference, or (3) Delusions. The first title 
is suggestive of either paranoid schizophrenia or paranoia. Because Case *# 12 did 
not display this symptomatology and Case #1 did, but only to a slight degree, this 
title was discarded. Likewise, all the cases did not display delusions of self-reference 
characteristic of paranoid states. All positively loaded cases did display delusional 
tendencies with the possible exception of Case #12. However, although his be- 
havior is more characteristic of the hebephrenic schizophrenic, the disoriented be- 
havior and poor contact with the environment would bring him close to the delusion- 
al frame of reference. Consequently the title ‘“‘delustons” was given to this factor. 
Factor III, All cases on Factor II were positively loaded. Although the similarity 
in symptomatology is not as clear cut as on Factor II, the pattern is fairly clear. 
In order to obtain tentative names for this factor, it was necessary to scrutinize the 
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early neuropathic traits of the patients in this group. All of them showed traits 
deseribed by Stern?’ in the neuropathic constitution syndrome. The tentative 
names of this factor were listed as follows: (1) Regression, (2) Feeling of inadequacy, 
(3) Physical symptoms (enuresis, headache, perspiration ete.), (4) Withdrawal, or 
(5) Anxiety. 

The most outstanding case of regression on this factor was Case * 25 who also 
exhibited the physical symptoms of enuresis (which also is a regressive character- 
istic). Case #25 has the highest loading on this factor and consequently this factor 
was called ‘regression’ with concomitant symptoms of feelings of inadequacy, with- 
drawal, and anxiety. It was believed permissable to do this because all such symp- 
toms are often auxiliary to and concomitant with regression and form a common 
core syndrome. 


Factor IV. Factor 1V was unique in that it had three cases which were also present 
on another factor (Cases * 20, 11, and 21 on Factor 1), and also differed from the 
other factors in being less well defined. All cases were positively loaded, although 
Case *7 was the only one above 50. Four of the cases were diagnosed as schizo- 
phrenia and two as anxiety neurosis or anxiety states. 

The choice of naming this factor was probably the most difficult of the whole 
group of four factors. A schizophrenic trend was seen to run through all the cases 
except the two anxiety states. All patients, however, were withdrawn from reality 
and although the ‘‘anxiety patients’? were not overtly psychotic, they possessed a 
psychotic diathesis. Two of the patients were described as being well oriented in 
place, space and time while the other four were detached from reality. In view of the 
marked withdrawal of the majority of patients, this factor was named ‘‘schizo- 
phrenia” with accompanying symptoms of detachment from reality and anxiety. 


ADDITIONAL METHODS OF VALIDATION 
The choice of syndromes for naming each factor was further checked by two 
statistical methodologies, viz. (1) A study of the MMPI profiles of all cases on a 
given factor and comparing the composite profile for each factor with profiles of the 
same syndromes reviewed in the literature, and (2) Obtaining the test weight for 
each subtest on each factor by means of the beta weights for multiple r’s and ascer- 
taining which subtests had the highest factor scores. 


MMPI Profiles. The choice of the MMPI profiles to validate the given syndromes 
was made on two bases. The first was that more work has been done on the validity 
of the MMPI than any of the other paper and pencil personality tests and the second 
was that this is the first test of this type upon which profile analysis has been done. 

In carrying out profile analysis on the MMPI, it is helpful to distinguish three 
aspects of the pattern: first the phasicality or number of peaks observed; second the 
slope of the curve, whether descending, neutral or ascending; and third the overall 
elevation. The clinical scales tend to group themselves into two clusters, the ‘‘neuro- 
tie triad” including Hs, D and Hy seales (hypochondriasis, depression and hysteria), 
and another cluster including the Pa, Pt and Se scales (paranoia, psychasthenia, and 
schizophrenia) or ‘“‘psychotic triad’. In the character disorders, this is less frequently 
observed, the more common pattern being an elevation on Pd (psychopathic de- 
viate), a secondary plateau on Pa, Pt, and Se scales. In most neurotic and psycho- 
tic groups both clusters will be elevated, giving a bimodal or diphasic appearance 
to the profile. 

The slope of the curve can be estimated by noting the relative position of the 
neurotic triad in relation to Pa, Pt and Se. If the latter are higher, then the slope is 
positive; if lower the slope is negative. For neurotic groups, the slope is usually 
negative; for psychotic groups the slope is usually neutral or positive. 

Finally the overall elevation should be considered. The higher the curve, the 
more serious the underlying disorder. Thus a severe psychoneurotice condition would 
be suggested by a diphasic curve, of negative slope and reduced elevation. A psycho- 
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tic condition would be suggested by a diphasie curve of neutral slope and moderate 
elevation. Gough has recently done a great deal of research on the profile analysis 
of the MMPI, and has estimated what he calls ‘typical’? psychopathological curves 
or profiles. These were compared with the empirical results of this study. 


Empirical Profiles in this Study. MMPI profiles were computed from the standard 
scores (T scores) of the MMPI on each case for each factor. Composite profiles were 
computed for all cases positively loaded and also cases negatively loaded on this 
factor. This was done by merely obtaining the mean of each of the scales (the ?, L, 
KX and F seales were omitted since they do not contribute as much to the diagnosis 
as the remainder of the scales) on both negatively and positively loaded cases on a 
given factor. In comparing Gough’s profiles with those determined empirically, the 
names of the reviewed profiles of Gough were chosen on the basis of those which most 
nearly resembled the nosology of the individual factors (both negatively and posi- 
tively loaded portions). 

[t was found that the shapes of the profiles on all factors were similar, although 
on some factors the actual numerical values for each of the scales (empirical and re- 
viewed) are different. Although Gough has not computed a typical normal profile, 
he has stated that normal profiles do not show phasicality. The negatively loaded 
case on Factor IL was in almost complete remission. This profile showed practically 
no phasicality. Factor [IV more than any other profile was a composite of a variety 
of psychiatric syndromes. However, two core syndromes stood out over all others. 
These were a schizophrenic syndrome (psychosis) and anxiety (a psychoneurotic 
syndrome). Consequently, a composite profile was made from Gough’s psychotic 
and psychoneurotie profile. The profiles of the reviewed and empirical cases on this 
factor were quite similar in shape. 


Test Weights on Each Factor. A second additional check was to obtain factor scores 
for each of the subtests of the cases on a given factor by means of the Doolittle solu- 
tion. The rotated loadings were used as the criterion correlations. These beta 
weights were put on a simple scale using a single whole number. Each beta weight 
was multiplied by the standard score of each person in that-subtest. This procedure 
was repeated for each of the subtests on the factor. These summated scores for each 
of the subtests showed rather clearly the relative position or standing of each of the 
subtests on the factor. In order to make comparison more easy, the distribution 
of total scores for each factor was grouped in quartiles of the range and the subtests 
in each of these quartiles were tabulated so as to enable comparison of subtests on 
each quartile. 

An inspection of the comparative weights of subtests on each of the factors re- 
vealed that the factor scores results substantiated the naming of each of the factors. 
It is significant that in both Factors I and Il the B,D of the Bernreuter was.in the 
fourth quartile of the range, characteristic of withdrawing people and those having 
difficulty with their environment. Factor LV was the only one whose syndromes 
were not consistent with the findings of the subtest scores on the factor. There is no 
apparent explanation for this except that the syndromes on Factor LV were so diffuse 
and inconsistent as to make naming extremely difficult. 


SUMMARY AND CONCLUSIONS 


1. Four paper and pencil personality tests were administered to twenty-five psy- 
chiatric patients at the Mare Island Naval Hospital in northern California. Raw 
scores on each of the subtests were converted to standard scores and a correlation 
table set up so that the scores of each patient were compared with the scores on the 
same subtests respectively of every other patient. 

2 


The results were submitted to factor analysis by means of the centroid method 
of Thurstone and four factors were extracted. These four factors were rotated into 
meaningful position. 
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3. Case histories and final diagnoses were obtained on each patient. Common 
psychopathological symptoms were obtained for each of the factors by abstracting 
from the case histories of the individual patients their symptoms. 


4. It was found that the symptomatology of the cases on each factor (both nega- 
tively and positively loaded cases) were similar enough to make tentative interpre- 
tation. These interpretations were then checked by two statistical methodologies, 
i.e. a comparison of the composite profiles of both negatively loaded cases and 
positively loaded cases with the corresponding MMPI profiles found in the literature, 
and obtaining factor scores for each of the subtests. In general, the results of the 
MMPI profiles and the comparative beta weights substantiated the results of the 
inverse factor analysis. 
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INTRODUCTION 


One of the more difficult problems faced by the staffs of large mental hospitals 
is the care and treatment of long-term regressed schizophrenics. These patients, 
bearing an almost uniformly poor prognosis, require an excessive amount of cus- 
todial care. The chronic shortage of personnel in large hospitals makes intensive 
individual treatment of these patients an almost impossible task. Many state 
hospitals have tried various forms of activation programs. Though these have met 
with some success, the large number of patients and the too few recreational and 
occupational therapists have made for only spotty progress. 

Two studies of intensive work with a small number of regressed patients at a 
time have been reported in the literature. Yoder“ reports a socialization program 
for hebephrenic schizophrenics. The mean length of psychosis in his group was three 
years. He employed a control group which was not treated. He states that the 
socialization group ended up with two recoveries, eighteen cases of improvement, 
and four unimproved cases, while the control group contained no recoveries, ten 
cases showing improvement, and fourteen cases showing no improvement. Myer- 
son? and Tillotson“? reported the results of a total push program. (The term total 
push was coined by them.) Their program consisted of four parts: (1) general medi- 
cal push, including physiotherapy and hydrotherapy; (2) exercise and games; (3) 
diet and vitamins; (4) psychological, consisting of new clothing and the use of pun- 
ishment, reward, blame, and praise (presented in the order of frequency of use). 
They report that all of their patients improved considerably, but that none re- 
covered. 

The twin factors of poor prognosis and large number of patients make extensive 
treatment of this kind impractical for use with all regressed schizophrenics. It was 
hoped in the Fergus Falls program that certain prognostic factors might emerge 
that would make for a better selection of patients for the total push type of treat- 
ment. Secondly, though recovery of the patients was held as a highly desirable goal, 
the main aim of the program was to make for better hospital adjustment of the 
patients. 

PROCEDURE 

Subjects. A pool of 200 patients was selected from continued treatment wards. 
The personnel selecting these patients were head nurses, psychiatric aides, super- 
visors, and ward physicians. The criteria for selection were (1) having been hospital- 
ized five years or longer; (2) seclusive and uncooperative ward behavior; (3) lack of 
participation in routine hospital work and recreation; (4) general untidiness; (5) 
being between the ages of thirty and fifty, and (6) a diagnosis of schizophrenia. 

Thirty male and thirty female patients were selected from this pool to be the 
experimental group. They were selected by taking every third patient until the de- 
sired N was achieved. Twenty of these were placed on the program first for a period 
of six months. The other forty were put on after the first six month period had been 
completed. (The situation for both groups was nearly identical. The statistical 
evaluation of data gave the same results for both groups. Therefore it was decided 
to treat them as a single group for convenience in this paper.) During the course of 
the year long program, 6 subjects were dropped for various reasons, which left a 
final N of 54. 

A control group of 60 was selected to match as closely as possible the behavior, 
age, sex, admission age, and length of hospitalization of the experimental group. 
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This group was to receive no therapy. During the course of the program, seventeen 
of these had to be eliminated as they either received some form of therapy (ECT, 
lobotomy, ete.) or were transferred to other hospitals. The final control group N 
was therefore 43. 


Criteria. Two types of criteria were used ; one fairly qualitative, the other 
quantitative and therefore susceptible to statistical treatment. The qualitative 
criteria were two; progress notes written by the nursing staff and the ward physi- 
cians, and work records as reported by the various departments and the nursing staff. 

There was but one quantitative criterion, the L-M Fergus Falls Behavior Rating 
Seale “’—hereafter referred to as the L-M Scale. This is a five point scale measuring 
behavior in eleven different areas. It is reliable and objective enough so that varia- 
tion in behavior ratings measure differences in patients’ behavior rather than in the 
raters’ subjective states. All ratings here mentioned are averages of the eleven areas. 
The experimental group was rated once weekly for six months, and the control 
group Was rated during the first and last weeks of the six month period. 


Organization of total push. While this aspect of the Fergus Falls program is to 
be reported in detail in the psychiatric literature, an outline of it is necessary here.! A 
psychiatrist advised procedure at all times. A psychologist was in charge of both 
the men’s and women’s wards. One staff physician was responsible for the physical 
care and treatment of the patients on each ward, and made the final ruling before 
any patient was approved for shock treatment or the more strenuous recreational 
and occupational activities. A registered nurse was in charge of the immediate 
ward activities and routine. One psychiatric aide II and at least two psychiatric 
aides | were on duty at all times. In addition to these regular personnel, occupa- 
tional and recreational therapists were regularly assigned to the wards for at least 
half of each day, and-3 student nurses were on duty for most of each day. 

All the personnel were engaged in social activation of the patients. After the 
first 6 to 7 weeks, during which all total push patients received 20 electro-convulsive 
treatments, the emphasis was on games, simple at first, and later developing into 
relatively organized group activities. With the given number of personnel, a 12 to 
14 hour schedule for each day for 6 months could be followed quite efficiently. What- 
ever individual activities could be carried on were centered about the patients’ pre- 
psychotic interests which had been investigated by the Social Service Department 
either in personal interviews or by correspondence with relatives. 


Experin ental design. All of the patients were on the total push wards for at least 
one week prior to the beginning of the series of electro-shock treatments. At the 
end of this pre-shock week each patient was rated by two aides or nurses using the 
L-M Scale. In this way the beginning behavioral status was ascertained. Each 
week thereafter, every patient was rated independently by two persons working on 
that ward. The average behavior rating for the last 4 weeks of the program was 
taken as the terminal level of behavior for each patient. 


RESULTS 

Empirical. At the beginning of the program, only 4 patients contributed any 
work to the hospital, at the end 13 were working, and 2 of these were holding down 
skilled jobs. At the beginning 22 patients had to have help in being clothed, at the 
end this number was reduced to 5. Prior to total push 32 patients were entirely mute, 
at the end only, 9 continued to be mute. At the beginning 36 patients were incontin- 
ent, this number was reduced to 23. At the beginning 43 patients were entirely 
seclusive, at the end 22 continued so. One male patient made a social recovery fol- 
lowing total push. Two other patients (one male and one female) made social re- 
coveries following lobotomies. How much total push contributed to their recoveries 


‘All patients receiving lobotomies are assigned post-operatively to the total push wards also. 
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has not been determined. Further research will investigate this problem, as well 
as the problem of how much ECT contributed to the total improvement. 


Statistical. As was mentioned earlier, the main therapeutic purpose of the total 
push program was to raise the behavioral level of the patients to a point at which 
they would not require excessive care. It was decided empirically that this should be 
the mean behavioral level of the entire population of the hospital. Numerically, 
the hospital mean is 2.9 on the L-M Seale. At the beginning, none of the experi- 
mental group stood at the hospital mean. At the end, 12 (6 men and 6 women) had 
reached or exceeded it. No members of the control group were at the hospital mean 
at the beginning, and none reached or exceeded it at the end. The 12 patients with 
final ratings above 2.9 had an average improvement rate of 1.15+.39. Only one 
patient in the control group showed improvement of 1.00. 

The mean behavioral level at the beginning of total push for the entire experi- 
mental group was 1.81+.59. At the end it was 2.27 +.76. There were no significant 
sex differences. The standard error of the difference was .11, yielding a K of 4.36, 
significant beyond the .001 level. The control group’s beginning mean was 1.85 +.39. 
Six months later its mean was 2.09+.55. The standard error of the difference was 
.14, yielding a Kk of 1.67 which is not significant. 


Prognostic factors. The following factors were investigated in the hopes of de- 
termining whether they have any relationship to improvement on total push: behav- 
ioral level at the beginning of total push, length of time spent in the hospital, age on 
first admission, diagnosis, number of letters to the superintendent inquiring about 
the patient during the first 5 years of hospitalization, education, marital status, re- 
ligion, and age when beginning total push. Of these factors, marital status was not 
useful because nearly all the patients were single. Education was not indicative 
because nearly all had 7 or 8 years of education. Religion was of no value because 
the overwhelming majority of the patients were of Scandinavian Lutheran faith. 

Those patients who improved most had better behavior when they began total 
push. The improved group had a mean beginning behavior rating of 2.02+.18, 
while the unimproved group had a mean beginning behavior rating of 1.75 +.34. 
This is a difference of .27+.09 with a Kk of 3.00. This is significant beyond the 
.O1 level. 

Those patients who improved had been in the hospital for a shorter time. The 
mean for the improved group was 7.92 years +3.42 years. The unimproved group 
had been in the hospital an average of 11.36 years +3.82 years. There is a differ- 
ence of 3.44 years +1.19 years. The Ix in this case is 2.89 which is also significant 
beyond the .01 level 

The diagnosis of the patient showed a relationship to the amount of improve- 
ment. There was a significant difference between the amount of improvement 
shown by those diagnosed as paranoid or simple types of schizophrenia when com- 
pared with schizophrenics of the hebephrenie type. The mean improvement of each 
of the four classical breakdowns was as follows: paranoid .64+.57, simple .62 +.48, 
catatonic .47 +.78, and hebephrenic .17 +.07. The differences which were significant 
were those between paranoid and hebephrenic, and simple and hebephrenic types. 
Paranoid vs. hebephrenic improvement equalled .47+.15, which yields a t of 3.13 
which is significant beyond the .01 per cent level. Simple vs. hebephrenic improve- 
ment was .45+.16 with a ¢ of 2.81 which is also significant beyond the .01 level. 

Another difference was noted which, though suggesting a trend, was not statis- 
tically significant. The improved group was older at the time of first admission than 
the unimproved group. The difference in mean ages was 3.21 years +2.48. The 
KX here was only 1.29 with significance short of the .20 level. 

The following two factors showed no relationship to the patients’ improvement: 
number of letters to the superintendent inquiring about the patient in the first 5 
years in the hospital, and age when beginning total push. 








J. O. SINES, RUBEL J. LUCERO, GORDON R. KAMMAN 


DISCUSSION 


Two things emerge rather clearly from this study. The first is that total push 
as utilized at Fergus Falls is of benefit to a goodly number of regressed schizophren- 
ics. However, it is too expensive a treatment to be applied to all regressed patients, 
as the returns in general are not sufficient to justify its use. 

When reaching or exceeding the mean institutional behavioral level is used as a 
criterion, it is found that only 22 per cent of randomly selected patients show enough 
improvement to justify the use of the treatment. It is encouraging to find that 
certain factors do distinguish between those who reached or exceeded the institu- 
tional mean and those who did not. Although the overlap on all factors is rather 
large, a judicious use of a combination of them to select patients for total push 
might well result in much more positive results. A new group has been selected at 
Fergus Falls by using these factors in the order of their significance. This group will 
be reported on at a later date as well as a follow-up study of the original group. 

There are a few theoretical implications in the study. That the less regressed a 
patient is, and the less time that he has been hospitalized, the better is his prognosis 
constitutes confirmatory evidence for two widely accepted hypotheses. That the 
simple schizophrenic bears a better prognosis than either catatonic or hebephrenic 
schizophrenics, runs counter to a widely accepted hypothesis, i.e., the simple schizo- 
phrenic bears the poorest prognosis of any of the schizophrenias. It may be that 
prognosis has been defined in terms of social recovery only, and that good institu- 
tional adjustment has not been considered by the proponents of this hypothesis. la 
this connection it is interesting to note that the only social recovery was made by n 
patient in whom simple schizophrenic symptomatology predominated. There are 
also two schools of thought regarding prognosis with respect to age at first break- 
down. The present-study supports the hypothesis that a person who has been 
integrated over a longer period of time has a better prognosis than someone who 
breaks down at an early age. 


SUMMARY AND CONCLUSIONS 


A group of 54 randomly selected regressed schizophrenics was placed on a 6 
month total push program at Fergus Falls State Hospital. A greater number of per- 
sonnel were available to work on this program than were employed with any other 
group of similar numbers in the hospital. All patients received a series of 20 electro- 
convulsive treatments during the first 6 to 7 weeks of the program. The L-M Fergus 
Falls Behavior Rating Scale was used to determine the beginning status of the 
patients and was continued in use weekly during the entire program. A control 
group of matched patients was similarly rated at the beginning and at the end of a 
6 month period. There was greater improvement for the total push group than for 
the control group during the experimental period. There were several factors which 
showed some relationship to the criterion of improvement and several others that 
did not. 


The conclusions drawn from the obtained results are: 
(1) Total push, as organized at Fergus Falls State Hospital, brings about improve- 
ment in some regressed schizophrenics. 
(2) ‘The factors of interest of the family as measured by the number of letters con- 
cerning the patient in the first 5 years and present chronological age are not related 
to improvement on total push. 
(3) The higher the original behavioral level, the better the prognosis (for improve- 
ment on total push 
(4) The shorter the length of hospitalization, the better the prognosis. 


(5) Patients diagnosed as schizophrenics of the simple and paranoid types have a 
better prognosis than patients diagnosed as schizophrenics of the hebephrenie type. 
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A diagnosis of catatonic schizophrenia does not differ significantly prognostic-wise 
from either of the above. 
(6) <A trend, not statistically significant, indicated a better prognosis for patients 
who first became mentally ill when older than for patients who became ill at a 
younger age. 
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THE INFLUENCE OF BODY POSITION ON MENTAL PROCESSES* 
HUGO G. BEIGEL 


Long Island University 


PROBLEM 


In the course of experimental work as well as in the clinical situation the writer 
noted evidence suggesting that the physical position assumed by the subject or client 
appears to influence the expression of thoughts and feelings. To test this possibility 


the following experiment was designed: 

A stimulus calling forth interpretation and other responses was to be applied 
once when the subject was sitting, once when he was standing, and once when he was 
reclining. Since the associations occurring in one situation had to be prevented from 
carrying over into the next, it was necessary to eliminate memory traces produced 
in any one of these steps. Because only in hypnosis can temporary amnesia without 
impairment of the normal functions be attained, the experiment was made with the 
aid of this tool. 

On different occasions volunteers were asked to interpret plates of the Thematic 
Apperception Test (not, as is usual, to tell a story) or were shown letters addressed 
to them that called for a decision. After they had given their responses in one posi- 
tion, amnesia was induced, the position was changed, and the same picture or the 
same letter was presented anew. The sequence of positions was varied. For Subject 1 
it was Reclining (R), Sitting (8S), Standing (ST); for Subject 2 it was ST, R, §; for 
Subject 3,8, ST, R, ete. While this method cannot prevent all emotional residua— 
if there should be any—from affecting the emotions and thoughts in the subsequent 
apperceptive process, it balances out the effects of positional patterns which might 
have resulted if the order of the positions had been the same for each subject. The 
purpose of the experiment was not explained to the subjects until the whole series 
had been completed, nor did those subjects who witnessed an experiment know that 
they would be used for a similar one. In addition, experiments requiring interpreta- 
tion of a picture were alternated with experiments in which a letter was used. 


Resutts Wire TAT Carp 8BM 
In the experiment described here six subjects were asked to interpret the same 
plate (8BM of the TAT) and six were asked to respond to a personal letter, five un- 
*This paper was read at the scientific meeting of the Society for Clinical and Experimental 
Hypnosis in New York, September 1951. 
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married men to identical letters supposedly written by their girl friends, a married 
man to an anonymous letter concerning his wife. No questions were asked except 
those noted below in the text, although it was sometimes necessary to prod the sub- 
ject by making affirmative sounds or by repeating his last statement. 

Although all experiments (twenty-two in all) resulted in the same reaction pat- 
terns, only twelve are presented here. The remaining ten were eliminated either be- 
cause their records were incomplete, or amnesia was not perfect (the subject re- 
membered having seen the card before), or because the stimulus material was ap- 
plied to only one subject. 


Subject 1, male, age 20; order of positions: R, 8, ST. 

R: The boy is thinking about a wounded soldier. He believes the soldier is his 
father. The father is telling his son about being wounded in service. 
Someone is hurt. He is in terrible pain. The man there looks like a doctor. 
But he isn’t a doctor. If he were he would wear a mask. And a hat. And gloves. 
The boy is the central figure. He must be thinking about what is happening. 

: The boy read a story about gangsters. One of them has been wounded. They 
took him to a doctor. No, that’s not a doctor. He isn’t masked. The gang- 
sters couldn’t take him to a real doctor. 


Subject 2, male, age 25; order of positions: ST, R, 8. 

R: The boy is thinking that some day he will be a doctor. I don’t understand the 
gun. ‘Lhe boy was playing and stopped to think of himself as a surgeon. 
Someone is sick. The boy is waiting and wondering what’s going on. 

: A fellow was hunting. He was shot at. He was rushed to the hospital. 


Subject 3, male, age 21; order of positions: 8, ST, R. 

R: A youth’s thoughts are depicted in the background. He is in trouble. A surg- 
eon is doing an abortion on a girl. The gun means suicide. Going away from 
everything. The door too means going away from everything. 

There is a cadaver on the table for the old medical operator. It might be a 
dream of the future. The boy is thinking. Why is a door there? The fellow 
looks like Pasteur. 

: The boy wants to go hunting. He’s thinking about slaughtering an animal 
or hurting someone physically. The doctor will then have to mend what he 
has done. 

eo 


Subject 4, female, age 25; order of positions: R, ST 


RK: The boy is thinking of an operation. Possibly his own delivery. The woman 
seems to suffer very much. But it is not giving birth. It is a man, I believe. 
He has tried to commit suicide. It’s the boy’s dream. 


It’s a boy’s fantasy or dream. He wants to become a doctor and perform an 
operation. This is an emergency. Very primitive circumstances. 

ST: Someone has been shot. The bullet is being taken out. This is a nightmare. 
The boy sees himself as a hero. He must perform the operation under any 
conditions. 

Subject 5, male, age 20; order of positions: R, 8, ST. 

Rk: The doctor looks familiar. Scene is like an old time movie. There is a crude 
sort of operation. I don’t know. 

The window is all... it’s all broken down. There is a bottle ...a doctor... 
poison. But then why ... I don’t see any connection. 

ST: It doesn’t make sense. They cut a man open. He may have swallowed some- 
thing. I once swallowed a penny. They gave me something to make me 
vomit ... I don’t feel well. (Has to be awakened.) 
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Subject 6, male, age 20; order of positions: ST, 8, R. 

R: A boy, a young man. In the background his thoughts. Some day he’ll become 
a surgeon. At that time the art of surgery was very young. Light and instru- 
ments are all wrong. The doctor’s head is not covered. This seems to be 
barrel of a gun. He is trying to decide whether he should be a soldier or 
doctor. Or he is thinking of someone who was a soldier and injured and is 
undergoing surgery. 

A person being cut open. There are two men. A boy in bed. Now I ean see 
better. It’s not the boy’s thoughts. The doctor is thinking of his son. 

: These are distorted thoughts. They are distorted. The boy is horrified. He 
has a horrified look on his face. He is horrified because of the men and the 
gun. He has an expression of concern. He must make a d®cision. 


If we compare the responses made in the standing with those made in the re- 
clining position, a definite pattern seems to emerge. The tendency toward activity 
and sometimes toward aggressiveness is strongly expressed in the standing position, 
in contrast to the remarks made in the reclining position. There is, in the standing 
position, nowhere meditation about the past or the future, as seems to be character- 
istic of the reclining position. Thought turns into intention (the boy wants to go 
hunting). In the one case (Subject 6) in which the action in the picture is described 
as thoughts, there enters an element of activity: the boy must make a decision. In 
four out of the six cases the interpretation in the standing position starts with an 
actual happening. When the subject describes action in both positions, it is more 
vigorous and violent in ST. Thus, the boy who was thinking about a wounded 
soldier (R) follows a ganster plot in ST. The boy who stopped himself in his play to 
dream about his future (Subject 2, R) is no longer mentioned; instead a hunting ac- 
cident is described (ST). The difference in the thought processes in this case is such 
that the subject cannot even fit the gun into their pattern (R). This seems the more 
remarkable since in this case the reclining. position succeeded the standing position. 
In the interpretations given by Subject 3—where an amount of aggressiveness is 
noticeable throughout—the degree increases from R to 8 to ST, although the se- 
quence in the experiment was 8, ST, R; he speaks of slaughtering animals and hurt- 
ing someone physically. Subject 5, though he has difficulty in rendering a coherent 
interpretation, resorts (ST) to a real incident in his life. the memory of which ap- 
parently grips him so strongly that he must be awakened. Subject 4, a female, who 
spoke of the depicted scene as of a dream (R), in the standing position calls it a 
nightmare and inserts an element of coercion. In the sitting position she reverts to 
the idea of a phantasy or a dream. 

The elements dominating the responses elicited in the reclining position are 
those of daydreaming and thoughts playing with several possibilities. The ideas are 
less frequently projected upon the plane of reality but remain in the sphere of phant- 
asy. Thus, Subject 2, R: the boy is thinking... ; ST: a fellow was hunting. Subject 
3, R: a youth’s thoughts ... ; ST: a boy wants to go hunting. Subject 4, R: the boy 
is thinking of . . . ; ST: someone has been shot. Subject 5, R: scene is like an old 
time movie... ; ST: they cut a man open. 

The thoughts in the reclining position appear to be of a milder, sometimes re- 
signed nature. There is also an indication that the thoughts in the reclining position 
have a greater width and breadth; there is a farther-reaching groping, a more fre- 
quent change from one interpretation to another, One could say they are more what 
is popularly called philosophical, whereas in the standing position they are more 
goal-directed. 

There are two conspicuous apperceptive distortions in this series. Subject 3 
interprets the operation as an abortion performed on a girl, and Subject 4 reads a 
childbirth into the picture. Subject 3 witnessed, before the experiment with him 
Was started, one of the trials described later, in which the subject considered an 
abortion as a way out of his “trouble.” Since amnesia was not induced for any ¢x- 
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periences preceding hypnosis, associations apparently carried over into his inter- 
pretation. Subject 4 was a female for whom in the passive mood of the reclining 
position, a delivery had possibly stronger apperceptive value than the shooting of 
which she spoke in the standing position. Since both of these gross misinterpretations 
occurred in the reclining position, their very incorrectness and far-fetchedness sup- 
ports the assumption that this position facilitates the roaming of thoughts and 
associations. 

‘The responses given in the sitting position seem to occupy a medial position be- 
tween those made in the other two positions. There are sometimes repetitions of 
responses expressed in R and ST. It appears as if in the sitting position the tendency 
toward unreality and the set toward action check each other to a certain extent. 
There is more detail than in the standing position and these details are more care- 
fully observed, but they are also better adapted to reality, though considerably more 
constricted than in R, and more reasonable and sober than in either R or ST. 

Resvuuts Ustinc Lerrers 

In the second part of the experiment each of six subjects was given a letter to 
read. The five unmarried men (Subjects 7, 8, 9, 10, 11) received the same letter. It 
Was supposedly written by the subject’s girl friend informing the recipient of her 
pregnancy and asking him to decide whether she should have the child or not. The 
first solution implied his marrying her, the second the outlay of money for an ab- 
ortion. ‘The sequence of positions was planned in the same manner as in the first part 
of the experiment. It was not possible in all cases, however, to get responses in the 
sitting position, since under the emotional impact of the message some of the six 
subjects rose, walked around, or remained standing. 

Subject 7, male, single. age 20; order of positions S-ST, R. 

R: That’s unpleasant news. Escape. Where? When I am drafted she needs an 
abortion. I can’t send her money. But I'll help her out. I could offer my assist- 
ance. Seclusion. (Q.: What do you mean?) She needs a secluded place to have 
the child. 

I don’t understand. (Rises, walks to the window, reads the letter over.) 
: (continued from sitting position) I'll call her up. I'll tell her to have the child. 
She mustn’t have an abortion. That would be taking a human life. (Q.: 
What are you going to tell her?) To have the child. (Q.: Unmarried?) It’s not 
my child. I don’t know whose child it is. I’m sorry. To hell with society. Vl 
help her as best I can. 
Subject 8, male, single, age 23; order of positions: S-ST, R. 

R: (Long silence after reading the letter. Turns the letter around, front, back.) That’s 
silly. (Evamines the letter once more.) That doesn’t concern me. It’s a New 
York telephone number. She doesn’t live here. It’s a gag. It’s a joke. It 
has to be a different name. A gag. I know only one Myrna. Another name 
probably ° 
I don’t want to marry. I’m not ready. ’m not in love with her. (Reads the 
letter again.) That’s ridiculous. That’s another party. (/ises. Now standing.) 

*: I know a girl by that name but I had no relations with her. It can’t be. I just 

saw her two nights ago. Ridiculous. (Q.: What do you intend to do?) Nothing. 
I don’t want to get involved in this. It’s not my child. If she Says SO, let her 
prove it. She has to be the one to prove it. 


Subject 9, male, single, age 22; order of positions: R, ; 

R: The letter doesn’t belong to me. It belongs to someone else. I haven’t done 
such a thing recently. (/xamines signature.) That’s not Mac, maybe it’s 
Marey. But L saw her only once or twice. She is crazy. We were only petting 
with each other. She seemed to like me. Now to send me such a letter. That’s 
a rotten thing to do; probably she sent it to see me again. 
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I can’t imagine her writing me a letter like this. She didn’t want to marry 
me and I can’t afford to get married. (Examines signature.) What’s that? 
May or Mary? I don’t know anyone by that name. 

ST: Who is the letter from? May? Mary? Maybe her name was May. But I don’t 
understand. We had an argument. She said she was jealous. I don’t want to 
marry her. She’s just trying to get even with me. I’m going to talk to her 
mother. 

Subject 10, male, engaged, age 20; order of positions: 8, R, ST. 

R: She needs an abortion? Six months later? No, I can’t let her go through that. 
The doctors are all quacks. Vl marry her. She was always passionate. Vll 
tell her to come home and we'll be married. I'll support her. We’ll manage 
somehow. 

S: I don’t know who it’s from. I can’t read the signature. (Since no positive 
suggestions are made, he simply shakes his head and remains unresponsive.) 

ST: You didn’t get this letter from June. She’s true. I never had relations with 
June. But if it is true, she has ruined our lives. I must give up my studies. 
What shall I do? Become a truck driver. Something like that. I must sup- 
port June and her child. I could work in the evening, but that won’t support 
three people. She’s ruined everything. I must marry her. I’m going to call 
her. There’s no other way out. I wouldn’t want anything to happen to her. 


rh 


Subject 11, male, single, age 23: order of positions: R, S-ST. 
R: Oh, dear. Why does she need me? There’s nothing wrong with her having : 
child. No use to stammer one’s way out. I would like the child. But she’s 
married. Do you have a telephone? Ill tell her not to worry. Everything 

is all right. 


8S: A child. I don’t know her well. I don’t love her. Maybe I could marry her for 
convenience’s sake. Nobody has to know about it. It could be a premature 
baby. (Gets up.) 

: (continued from sitting position) How did I get into this? I don’t love her. An 
abortion costs a lot of money. A fellow likes to have children. Yeah, I could 
marry her. It’s the only thing to do. I’m going up there now. Writing a letter 
would take too much time. 


‘ 


S1 


Subject 12, age 26, received a different letter, since he was married. The letter 
was presented as having come from an anonymous friend. It informed the husband 
about the wife’s infidelity. The subject was started in the sitting position, but after 
receiving the message, he became extremely excited and paced the floor. When, after 
a long period of silence, he was asked what was the matter, he replied, “I don’t want 
to talk. I’ve got to think it over.” (Upon being asked to show the letter to the operator, 
he complied. When the operator read the letter and repeated its last question, ‘‘ What are 
you going to do about it?” he began speaking:) 

ST: I don’t believe it. She travels a lot. She gets gifts from men when she travels. 

Men have tried but she always told me about it. (Reads the letter again.) 
That’s a lot of baloney, It’s bunk. (Tears the letter up.) My wife wouldn’t 
do such a thing. That’s a joke, (very indignant) a bad joke. I won’t let it 
bother me. I’m positive a kid is playing a prank. I’m not going to worry about 
it any more. I'll go home. 
(Receives an identical letter, reads it, turns it around.) I don’t believe it. That’s 
bunk. There’s no sense to it. How did I get that letter? It’s anonymous. It 
doesn’t say it’s for me. Maybe it isn’t for me. She was in school this after- 
noon. I spoke to her this afternon. She’s pretty. Men would be after her. 
But she is true. (Scrutinizes the letter and smiles while reading aloud.) “If I 
were you I would watch my wife.’”’ Why should I watch his wife?) Who wrote 
that letter? I’m going to show it to Grace (his wife). Wonder who sent it to me 
me. It’s all very funny. 
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Two new elements were introduced in this part of the experiment: (a) the sub- 
jects were personally involved in the problem presented, and (b) a decision had to 
be made. 

In all cases emotions are more vigorously expressed in the standing position than 
in the reclining position. The least difference is noticeable in the case of Subject 7. 
But even he has a short outburst in position ST: ‘To hell with society’’, whereas he 
comments quietly in position R:*tThat’s unpleasant news.” Though unexcited, he 
defends himself (ST) against the inference that the child could be his. Subject 8 de- 
clines any involvement rather resolutely in position ST, whereas he remains incred- 
ulous in position R and waves the whole affair away as a joke or a mistake. Subject 
9, although rejecting any responsibility, becomes aggressive in position ST and 
accuses the girl of attempting to get even with him. Subject 10, aware of his responsi- 
bility for the girl and unwavering in his love for her, has an outburst of bitterness in 
position ST as he considers the threat to his future, whereas in position R he merely 
thinks of the girl’s welfare and resignedly accepts the inevitable. The difference in 
reactions is dramatically evident in Subject 12. Excitement and indignation are 
strongly expressed in position ST and the process of warding off the lingering sus- 
picion ends in the violent gesture of tearing up the letter. In position R he at no time 
really suspects his wife. The whole thing is a joke and he treats it as such. While in 
the standing position he is indignant when he calls it a joke, he smiles at the awk- 
ward attempt in position R and amuses himself with the phrasing of the letter and 
intends to show the note to his wife. 

Subject 8 did not consider a decision necessary. Subjects 9 and 12 come to a 
decision only in the standing position. But wherever decisions are made in more than 
one position, they are more forceful in position ST. Thus, Subject 10 wants to call 
the girl immediately (ST), but in position R he leaves the idea of communicating 
with her dangling, saying merely that he is going to tell her. Subject 11, who when 
reclining makes up his mind to call his girl and tell her not to worry (since she is 
married), decides (ST) to call her and tell her that he is going to marry her. 

In contrast, there is more careful weighing and pondering in the reclining 
position. The letter is scrutinized, date and signature are checked, actions that in no 
case occurred in position ST. Subject 11 forgets entirely his argument (advanced in 
position R) that the woman is married, and decides to marry her although he does 
not love her. And, finally, just as in the interpretations of the TAT card, the hap- 
pening is perceived on another plane; it never has the same reality that it has in the 
standing position. 


SUMMARY 


The hypothesis is formulated that body position exerts considerable influence 
on apperception, thinking, and emotions. The reclining position has a tendency to 
remove these processes from reality considerations and consequently allows greater 
complacehcy. It favors expansion of suggestions and associations, but inhibits the 
tendency toward action. More room is given to the exploration of both relevant and 
irrelevant factors. In the standing position the energies are stimulated toward action 
and emotions express themselves more forcefully. At the same time the field tends 
to be narrowed and the influx of new suggestions is partly blocked. Situations are not 
scrutinized as carefully and surrounding circumstances and possibilities are not 
sufficiently examined. Decisions come faster and are more vigorous. The sitting 
position seems to favor an equalization between these antagonistic tendencies. It is, 
however, closer to the standing position than to the reclining position. There is a 
more factual account of perceived material, but the greater richness of this material 
suggests that some of the inhibitions of the ST position are partly relaxed. In an emo- 
tional situation there is a tendency to motor activity, to get up, whereas there is no 
attempt even to sit up from the reclining position. Apperception becomes selective 
in accordance with the basic tendencies of these positions. 
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Though the limited scope of this experiment does not permit us to draw definite 
conclusions, the findings suggest research possibilities by which information that we 
still lack about the thinking and apperceptive processes could be obtained. If the 
basic hypothesis should be confirmed, it would be interesting to learn whether the 
position of the brain is a determining factor or whether it is changes in the supply of 
blood to the brain or the muscular set of the whole body that influences the brain’s 
activity with regard to reasoning, decision, and imagination. It may also be interest- 
ing to note that certain suggestions arose only in certain positions. Suicide, for in- 
stance, was mentioned by two subjects in the reclining position, but in no case in the 
standing position. If the reclining, sitting, and standing positions stir to expression 
contrasting facets of the personality, there is a possibility that responses to pro- 
jective tests such as the Rorschach or the TAT may give different results if admin- 
istered in different positions. 


ING PSYCHOTHERAPEUTIC PROTOCOLS WITH CONTEXT 
COEFFICIENTS 
JOHN M. BUTLER 


University of Chicago 


In recent years several interesting attempts have been made to assess the 
change of individuals over counseling or psychotherapy by way of constructing 
categories of behavior to be studied over time. The nature of the categories used has 
depended upon the theoretical orientation of the worker involved; thus, Dollard 
and Mowrer“? have derived the Discomfort-Relief Quotient, a measure dependent 
upon classes of verbal behavior considered to be indicative of ‘‘tension” and ‘“ten- 
sion-reduction”’. Raimy and Kaufman? have used a quotient based on the use of 
favorable, unfavorable, and ambivalent self-references, categories derived from 
Raimy’s work in the realm of self-theory. These examples are typical: each worker 
selects some category of behavior to be studied; counts the frequency of appearance 
of sub-classes within the category; transforms the frequencies to proportions or per- 
centages; and draws inferences with regard to his hypotheses by inspecting the 
slopes of his curves“?. If more than one class of behavior is under consideration, the 
correspondence or latk of it between the curves of the several quotients is ascertained 
and further inferences may be made about the therapeutic process or therapeutic 
progress of the client or patient. 

The use of quotients of the form < A 3 has been necessitated by practical 
considerations such as the varying length of therapeutic interviews, the varying 
length of psychotherapy, variations between clients and other essentially irrelevant 
factors which affect the production of the behavior being studied in such a way that 
comparisons of frequencies over interviews and over clients would be highly mis- 
leading. The use of quotients of the form < “ jy has, however, at least two implica- 
tions which generally have not been explicitly recognized. In the first place the quo- 
tient immediately suggests its complement, a py 80 that > “ s + xa = las 
a matter of mathematical necessity. Each quotient not only implies the other but 
specifies it. Thus if <o ; is the Discomfort-Relief Quotient, then ra ph is the 
Relief-Discomfort — Secondly, the fact that the quotients sum to unity sug- 
gests that the categories A and B are exhaustive or that they define a domain in 
which other categories such as C, D, E, ete., are irrelevant. In a logical sense this is, 
of course, true, but in a psychological sense it is indeed questionable. It is a com- 
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monplace among psychologists that the significance of a given behavior derives at 
least partly from the context of the behavior in which it occurs or that antecedent 
responses influence his subsequent responses. 

Viewed in this light, the right hand member of the above equation implies 
either that all of the behavior of the individual is A and B behavior or that A and B 
behaviors are given weights of 7 and that all other behavior is given a weight of zero. 
The former implication is nearly always untrue in fact and the assignment of weights 
of zero seems, from a clinical standpoint at least, to be somewhat arbitrary. One 
might argue that it would be better to use quotients in which the total number of 
of responses appears in the denominator. Such quotients would have the form 
— =< @ Where O denotes the “other” or irrelevant responses. However, such 
coefficients have a serious deficiency. Coefficients of the form , =: _ Yield informa- 
tion on what might be called the balance of A and B; they do not show the influence 
of context. Coefficients of the form 3-3 . = reflect the influence of context and 
confound the information with respect to balance. This implies that both forms 
should be used and certainly this is an alternative to using but one form. It seems, 
however, desirable to use a coefficient which would show the influence of context 
upon the balance coefficients. 

It seems worthwhile to consider in some detail the influence of context upon the 
interpretation of quotients of the form > > y: Lake, for example, the Discomfort- 
Relief Quotient, DRQ, in which D refers to discomfort responses and R refers to 


eel i I a a Fr Sa 
relief responses. If 5-3 = -5, then p= = -5. The D and R behaviors ar 


then considered to balance each other since they are equal in number. If the DRQ is 
smaller than the RDQ, the balance is in favor of the relief responses, and if the DRQ 
is greater than the RDQ, the discomfort responses outweigh the relief responses. 
Presumably, then, successively equal values of the DRQ over psychotherapy would 
lead to the inference that the tension-relief balance of the chent had not changed 
throughout therapy. Is this a necessary inference, granting the theoretical validity 
of the DRQ? The answer can only be no. Assume that there are 100 client responses 
in each interview and that there are 50 interviews. In the first interview there is one 
each of D and R responses. In the second interview there are two each of D and R 
responses and so on until the last interview is reached in which there are 50 each of 
1) and R responses. We are not likely, except in other armchairs, to.get such well 
ordered data in which the DRQ is precisely .5 for all interviews. Nevertheless, the 
example suffices to show that while the balance is numerically equal throughout, 
there are changes over therapy. From such data one might infer that the client was 
progressively makingr esponses intimately related to his problems, and that he be- 
came less and less defensive as time went on, if it is held that tension responses gen- 
erally relate to maladjustment and relief responses relate to adjustment. 

We might find that the DRQ becomes successively smaller over therapy which 
might be taken to indicate that the tension level has decreased as a result of psycho- 
therapy. In the first interview the DRQ might be, say, .9 with D = 9, R = 1 and 
r .90. Suppose that the DRQ gets successively smaller with a final value of .375 
with D = 30, R = 50, O = 20. In the final interview there are more than three 
times as many D responses as in the first and the R responses have increased fifty- 
fold. The inference that the tension level has decreased seems to the writer to be 
open to question. In fact he would be tempted to infer that the tension level had 
increased. The same doubts could arise if the successive values of the DRQ increase 
throughout therapy. Presumably such a result would be interpreted as lack of 
progress. Yet if the O responses decreased successively, one might conclude that the 
individual worked at an increasingly emotional level but that he terminated therapy 
before the emotional difficulties had been alleviated. 

The foregoing examples illustrate the influence that context may have upon 
the interpretation of balance coefficients and lead to the conclusion that the basic 
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difficulty involved in interpreting balance coefficients is occasioned by the fact that 
they are invariant with respect to context. Of course, this invariance may be a de- 
sirable feature depending upon the interest of the investigator but it seems imposs- 
ible not to conclude that the contextual influences must be explicitly considered in 
the interpretation of balance coefficients of the form ce y When they are applied 
to therapeutic protocols. 

The influence of context upon balance can be ascertained if we set the condition 


. A B . 
that the balance coefficients, ,—g and ;~~,j sum to unity only when the A and B 


responses are the only responses. That is, all responses given are either A or B. Then 
the equation * + . = 1.00 where T = A + B® indicates both the balance of 
A and B and the fact that all of the responses in the interview were A and B and 
nothing else; otherwise stated, the A and B responses were emitted in a context of 
only A and B responses. Of course, the balance of A and B is shown <s but this 
is so only because the members of (1) sum to unity. When A + B = T, the incidence 
of other types of responses, O, is greater than zero. In such a case the left hand mem- 
bers of (1) do not satisfy the restriction that they sum to unity only when O = 0. 
The problem is to so modify the left hand members of (1) that the condition is 
met. This cannot be done by using aie and <> ao since these co- 
efficients sum to less than unity. However, the condition can be met by weighting 
ach of the balance coefficients by A/T and B/T respectively (T = A + B + O) 
and then taking the square roots of the products. That is, the left hand members of 
(1) are modified to read as follows: 


A A B B , 
\ j s “ (3 ) and \ (, + " ( zy (1) 
These coefficients will be called context coefficients since they show the influence of 
context (or of O responses) upon balance coefficients. Since the context coefficients 
contain the balance coefficients, and since the context coefficients are identical with 
the balance coefficients when O responses are not present, we can regard balance co- 
efficients as limiting values of the context coefficients which are approached as the 
number of O responses approaches zero. 
The context coefficients sum to less than unity when the number of O responses 
exceeds zero. Therefore, the equation 


aes he oe 


holds only when O responses are absent. When A = B, O > 0, then, as with balance 
coefficients, the context coefficients will be equal, but they will be smaller than the 
corresponding balance coefficients which will each have the value, .5. How much 
smaller they will be depends upon the total number of responses. For example if 
A = 1,B = 1,0 = 998, T = 1000, then 


B 


~) = 1.00 (2) 


y (5) (001) + ay (5) (001) = .022 + .022 = .044. (3) 


As is the case with the balance coefficients, the context coefficients are equal. 
However, the low values of the context coefficients indicate that the equality of 
balance of A and of B responses occurs within a preponderantly large context of 
other (or O) behaviors. The context coefficients indicate simultaneously both the 
balance of the behaviors being studied and the effect of the context upon the balance. 


The properties of context coefficients may be summarized as follows: 


1. They sum to a maximum of unity when all of the responses are A and B 
esponses and are then equal to the balance coefficients. 
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2. They are equal in magnitude to each other when A = B, O > 0. 
3. They decrease as the proportion of O responses decreases relative to A and B 
responses. 


4. They increase when the proportion of A and B responses increases relative 
to O responses. 

5. Ifa given balance coefficient is greater than its corresponding balance co- 
as : A as Se alat? ea eon ee : 
efficient (if —3-—3 < q—¢7), then the same relation holds for the corresponding 
context coefficients: 


Since comparison of the two context coefficients yields the same information as 
a balance coefficient and since the context equation (2) explicitly shows the influence 
of context upon balance, it is considered that in the study of psychotherapeutic 
protocols the use of context coefficients is preferable to the use of balance coefficients 
of the form, x B 
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A NOTE ON THE DIFFERENTIATION OF NORMAL AND NEUROTIC 
CHILDREN BY MEANS OF OBJECTIVE TESTS* 
H. J. EYSENCK AND D. B. PRELL 


Psychological Department, Institute of Psychiatry (Maudsley Hospital), London. 


PROBLEM 

In a previous paper? we have shown that a sample of ‘normal’? monozygotic 
twins were appreciably more alike with respect to a measure of ‘‘neuroticism”’ based 
on a battery of objective tests than were a sample of ‘‘norma]”’ dizygotic twins, and 
the conclusion was drawn that the quality underlying these objective tests was to a 
considerable extent inherited. On the basis of certain somewhat doubtful assump- 
tions 7, it was calculated that the influence of heredity on the factor score amounted 
to 817, although no great value was placed on the exact figure. In any case, there 
appeared to be little doubt that heredity played about as large a part in the genesis 
of neuroticism as in that of intelligence. A discussion of an operational definition of 
neuroticism will be found elsewhere ®: *. 

An attempt was made to obtain an outside verification of our identification of 
the factor isolated from the intercorrelations of the tests as one of ‘‘neuroticism”’ by 
showing that tests having high correlations with this factor also discriminated well 
between our whole group of 100 twins (who might be regarded as a “‘normal”’, i.e. 
unselected section of the total population) and a control group of 21 neurotic child- 
ren (i.e. children definitely diagnosed as suffering from neurotic disorders by senior 
staff members of the Maudsley Child Guidance Clinic.) A correlation of .76 was 
found between the factor loadings of the 17 tests used, and their biserial correlation 
with the normal-neurotic dichotomy. As this proof is somewhat indirect and depend- 
ent on complex statistical argument, and as it does not enable us to state the actual 
amount of discrimination achieved by the test battery in question, a different method 
of proof is attempted here to show that the statistical factor isolated can indeed with 
some show of reason be labelled “‘neuroticism’’. 


*We are indebted to the Eugenics Society for a grant which made this study possible. 





A NOTE ON THE DIFFERENTIATION OF NORMAL AND NEUROTIC CHILDREN 203 


METHOD AND RESULTS 

From our battery of tests we obtain a single, weighted score! which, if the qual- 
ity measured by the tests can really be called ‘“‘neuroticism’’, should discriminate at a 
very high level of statistical significance between our “normal” and our neurotic 
children. In addition, we should be able to formulate certain other hypotheses cap- 
able of being tested in terms of the figures obtained. Thus for instance the neurotic 
group was very highly selected, and should therefore be rather homogeneous with 
respect to neuroticism; the “normal” group was completely unselected, and should 
therefore be relatively heterogeneous. We would therefore expect the ‘“normal”’ 
group to show a greg iter spread of scores (higher variance) on the measure of neurot- 
icism, Again, in view of the strict selection of the neurotic children, we should expect 
all of them to have definitely neurotic scores; the “normal’’, unselected group, on the 
other hand, would be expected to have between 5°, and 10°; of severely neurotic 
children in it (cf. R. Fraser and Burt for data supporting this statement). Con- 
sequently, a small proportion of the so-called ‘“‘normal’”’ group should in fact have 
neurotic scores. 

With these hypotheses in view, factor scores were calculated for the 21 neurotic 
and the 100 ‘“‘normal’” children. A diagram presenting the main results is given 
below (Fig. 1.). The portion of the diagram representing the “normal” group was 
reduced in size in the ratio 21/100 in order to equate the total area of the two dis- 
tributions for easier comp: ison. The mean score of the neurotic group was 13.24; 
that of the ‘normal’ group was 23.08. The significance of this difference was tested 
by means of a very conservative method (using means of pairs of twins as scores, in 


Fig. 1. COMPARISON OF DISTRIBUTION OF SCORES FOR 21 NeEuROtTIC CHILDREN AND A CONTROL 
Group oF 100 NorMAt CHILDREN. 
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order to avoid the pitfalls inherent in the use of separate children’s scores, which 
assumes a complete lack of correlation between the children) and was found to be 
well beyond the p=.001 level. There remains, therefore, no reasonable doubt that 
“neuroticism” scores differentiate with considerable accuracy between neurotic and 
“normal” children. (It may be noted that the mean scores of the monozygotic twins 
and the dizygotic twins were almost identical—23.20 and 22.96 respectively, and that 
each of these scores separately is very significantly different from that of the neurotic 
group. 

It will be evident from Fig. 1 that our expectation with regard to the greater 
spread of scores of the ‘‘normal’’ children is also borne out. The variance for the 
“normal” group is several times that for the neurotic group, a difference significant 
beyond any reasonable doubt. Again the values for pairs of dizygotic twins (50.83) 
and pairs of monozygotic twins (172.67) taken separately are significantly greater 
than the value for the neurotic children (12.60). (The difference in variance between 
pairs of dizygotic and monozygotic twins was already noted in our previous paper; 
it is almost entirely due to the fact that we are considering scores for pairs of twins, 
and that the correlation between pairs of twins is much higher for monozygotic (.85) 
than for dizygotic twins (.22). When we take scores for individual children, the var- 
iances become 91.55 for monozygotic and 41.47 for dizygotic twins.) 

It will also be seen from Fig. 1 that while none of the neurotic children have 
scores as high as the mean of the ‘normal’ group, about 10°; of the “normal” child- 
ren have scores as low as the mean of the neurotic group. This is of course in part a 
consequence of the difference in variance between the two groups, but it does seem 
to indicate the correctness of our hypothesis that our process of selection would en- 
sure that all our neurotie children were really neurotic, but could not ensure the 
absence of neurotic children in the ‘normal’ group. If psychiatric screening had 
taken place with respect to the “normal” group, it seems certain that our neuroticism 
score would have shown much higher discriminatory powers. Such screening would 


of course have vitiated the main aim of our experiment, i.e. the study of the degree 
of inheritance of neuroticism, by restricting the range of the phenomenon under con- 
sideration; the point is merely mentioned because it is not always kept in mind in 
assessing the usefulness of objective tests in discriminating groups of neurotics and 
unscreened ‘‘normals’’. 


CONCLUSIONS 

It may be concluded that the discrimination achieved appears very promising, 
particularly in view of the doubtful status of some of the ‘‘normal”’ children. It 
should also be borne in mind that the battery of tests used represents the best that 
could be done at the time when this experiment was begun, i.e. in 1948; much better 
batteries of tests are now available through the work of Himmelweit and Petrie “®? 
and others who have improved the older tests, or suggested new ones, and it may be 
confidently anticipated that research now in progress will show considerably better 
differentiation than that found in this study. However, even at its present level, ob- 
jective tests of neuroticism might be of considerable practical use as screening tests, 
i.e. in isolating the child most in need of psychiatric attention. Taking all these facts 
into account, it would appear that our identification of our factor with ‘‘neuroti- 
cism”’ finds considerable justification in the results here reported. 
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POST-HYPNOTIC SUGGESTION IN GROUP THERAPY: A NOTE! 
WILLIAM B, SINGER 


Veterans Administration Hospital, Vancouver, Washington 


In Veterans Administration general, medical and surgical hospitals which have 
a neuropsychiatric service, the admission rate is high, and the average period of 
hospitalization is limited by consideration of bed space. In this setting there is a 
distinct need for abbreviated types of psychotherapy. At this hospital this need is 
met to some extent by an intensive program in which individual therapy, group 
therapy (discussion) and psychodrama are integrated so that each supplements the 
other. An additional technique has been developed which involves the use of hyp- 
notherapy with the two forms of group therapy. 

Hypnotherapy has been found useful at this hospital for expediting the penetra- 
tion of resistance, the elicitation of significant repressed material and the integration 
of this dynamic material with the conscious mind. Although the efficacy of direct 
post-hypnotic suggestions directed toward the development of salutary changes in 
the patient is very transitory, it is nevertheless possible to exploit this short-term 
effectiveness to prepare patients for maximal parti¢ipation in group forms of therapy. 
A number of the patients having hypnotherapy also participate in the group therapy 
and psychodrama programs. The following procedure is used with the patients of 
this group who are capable of attaining deep or somnambulistic trance depth and 
can reliably execute post-hypnotic suggestions. 

When certain problems which are valuable and feasible for group discussion 
emerge under hypnosis, the patient is given a post-hypnotic suggestion that he will 
discuss this problem in the next group therapy session. He is instructed that he will 
become acutely aware of this problem at the next group meeting, will discuss it 
freely and will express all of the significant feelings associated with it. He will be 
alert to and ready to accept any feelings and insights generated in the group dis- 
cussion. When the patient subsequently discusses these problems in the group sess- 
ion, he receives the therapeutic benefits which are intrinsic to the group situation, 
such as reality testing, group acceptance, etc. In addition, his high level of motiva- 
tion and emotional participation acts as a ferment or catalyst to elicit a high degree 
of emotional participation in the members of the group. 

The same technique has been applied to prepare patients for maximal partici- 
pation in psychodrama. It is suggested to the patient that before the next psycho- 
drama meeting, he will become very interested in dramatizing a certain problem and 
will develop an appropriate plot. He will react to the other participants as if he is in 
a lifelike situation, will throw himself into his role with intense emotion, and will 
be keenly receptive to significant feelings and insights occurring during the per- 
formance. 


‘Reviewed in the Veterans Administration and published with the approval of the Chief Medical 
Director. The statements and conclusions published by the author are the result of his own study and 
do not necessarily reflect the opinion or policy of the Veterans Administration. 








A NOTE ON PROFESSOR R. B. CATTELL’S METHODOLOGICAL 
ADUMBRATIONS 


WILLIAM STEPHENSON 


University of Chicago 


In this journal“? Cattell has outlined a set of techniques of factor analysis, O, P, 
Q, R, 8, T, .. . ete., which is meant to be more comprehensive than our first formu- 
lations along such lines in 1936. It should be pointed out, however, that our own 
had to do with multivariate analysis, a statistical matter, and not with the innumer- 
able applications of the two major systems R and Q we originally proposed. The 
various techniques of Cattell are applications of the original multivariate for- 
mulations, and these can be legion in almost every branch of psychology. It is 
very simple to demonstrate this by noting that we can propose two further schemes, 
along Cattell’s lines, superordinate to all his, one called H, and the other D. In the 
former everyone in Cattell’s designs could be in a state of hypnosis, all the measure- 
ments being repeated for this condition. In the other all the subjects could be in a 
state of intoxication (D), or confusion, or both. 

Moreover, Cattell is in error about the relationship between Q and R techniques. 
These refer to different matrices of data with quite different postulates, involving 
quite distinct and different methodologies, to which Burt’s reciprocity principle ©? 
can in no way apply. Burt himself is now more careful about this matter, if we are 
to judge by recent observations ©, 

The area of methodology into which Cattell has moved is a difficult one, requir- 
ing much more than naive Millian conceptions of scientific method for its elucida- 
tion. It happens that Q-technique brings almost all that has been regarded hitherto 
as ‘‘subjective’”’, that is, man’s reflections, musings, retrospections, dreams, and the 
like, his self-notions, and every manner of verbal report, into the domain of singular 
testable propositions, and that theories that were hitherto regarded as untestable 
and unscientific, can now be reduced likewise to testable propositional form with 
Fisherian methods and factor analysis as the analytical tools. Even a test such as the 

torschach, long a center of criticism because of its supposedly unscientific structure, 

can now be shown, along lines of Q-methodology, to be a brilliant foresight by Ror- 
schach of a method of experimenting, called the postulatory-dependency method, a 
collateral of the hypothetico-deductive method, about which much will be heard in 
future. This, as in all Q-technique studies, begins with a postulatory set, but whereas 
the nomotheticist wishes to test these postulates (and to do so assumes that the 
most impossible of all propositions are at issue, namely universal, nomothetic 
ones), along Q-technique lines quite different matters are tested, namely proposi- 
tions concerning a theory. 

That these matters are not generally or widely known is because the fruits of 
Fisherian methods“, of certain conclusions of logical philosophy, notably of Witt- 
genstein, Schlick, Carnap, and others, and the contributions of Kantor to logic are 
not as well known to psychologists as they should be. 

Meanwhile, we would like to state categorically that there exist only two factor- 
systems in the domain of multivariate analysis, one called R, and one Q, and that these 
cannot be reduced to a single system. Roughly the former deals with nomothetic 
postulates and with interdependency forms of multivariate analysis, and the latter 
deals with idiographic postulates and dependency forms of multivariate analysis. 
The two are under no circumstances merely two ways of looking at the same facts. 
Since we can claim prior use of the designation Q, dating back to 1935), it seems 
somewhat ironical for Cattell to borrow the term to perpetuate the very misconcep- 
tions against which Q was directed in the first place. We resist any such peculiar 
displacements, and would ask for the letter Q to stand for the system adumbrated 
at the outset. Moreover, for Cattell to suppose that Q-technique is concerned with 
“subjective” matters only is another of the profound mistakes with which this sub- 
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ject seems fated. We long ago gave examples of its application to the study, for 
example, of performance. Here again, however, if we take a leaf from Brunswik’s? 
essays it is not difficult to show that there are siz representative probing-points into 
behavior—those concerning (i) the ‘internal’, (ii) the ‘external’ and (iii) the 
“historical”, frames of reference, each from (a) the subject’s own, or (b) the ob- 
server’s vantage points. American Self-psychology deals with (i) from the stand- 
point (b) (to judge by Snygg and Combs“). Classical Behaviorism, and almost all 
its derivatives, deals with (ii) from standpoint (b). Psychoanalysis is largely con- 
cerned with (iii). It happens that alone of any techniques known to us, Q-technique 
can be used to probe into behavior from all six of these representative possibilities. 
R-technique, and the various applications of the so-called “objective” procedures 
referred to by Cattell, concern b(ii) and b(iii) at most. So that what is regarded 
as limited and static by our critics, turns out to be of the widest possible scope, 
never even approximated heretofore in any of the systematic psychologies. 
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CONSTRUCTION OF A KEY TO DETERMINE RECIDIVISTS FROM 
NON-RECIDIVISTS USING THE MMPI 


ROBERT A, FREEMAN AND HARRY M. MASON 


Washington State Penitentiary 


Clark recently published a key for the MMPI intended to differentiate re- 
cidivists from non-recidivists. However, he did not report applications of the key 
to groups other than the one used in constructing it. The present note reports ap- 
plications of Clark’s key to MMPI responses of recidivists and first-offenders at the 
Washington State Penitentiary, and attempts to develop and validate further re- 
cidivist keys by item analysis methods. 

Since August, 1949, all literate, newly-admitted, male inmates to the prison 
have responded to the MMPI. Tests were administered after a 30 day orientation 
period. By a review of the records of inmates, those having one or more previous 
convictions of a felony were differentiated from those being admitted to a peni- 
tentiary for the first time. Clark’s key was applied to MMPI responses of 60 recidi- 
vists and 40 first-offenders, chosen as consecutive admittees, fulfilling these defini- 
tions. Line 1 in Table 1 shows the results. Clearly, the key shows no differentiation. 
Line 2 in Table 1 shows separation of criterion groups of recidivists and first-offend- 
ers used in constructing a key by item-analysis procedures from WSP records. All 
items showed. tetrachoric r’s of .40 or greater with recidivism. This yielded 14 items, 
only one of which (item 234, Group form) appeared in Clark’s key. Line 3 in Table 1 
demonstrates shrinkage to zero validity when this key is applied to new groups 
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TABLE 1. 


Ist 


Criterion and Group Reeidivists Admissions 


Clark’s Recidivism key applied to WSP in 10.1 0 
uites’ records Eee 6.3 6 
. 60 


14 item recidivist scale developed by item- 
fil alysis tetrachorie on WSP inmates 


3. 14 item recidivist scale developed in (2) ap- Mean 
plied to next consecutive records. Ss. D. 
N 
4. Reeidivist scale developed by item-analysis Mean 
(26% or more difference) 43 items. S. D. 
N 
5. Seale developed in (4) applied to next con- Mean 
secutive cases. s. D. 5.0 


N 10 


similarly drawn. A final attempt at key construction, shown on Line 4 in Table 1, 
employed 100 recidivists and 100 first-admittees. A difference of 138°; or more in 
frequency of true response between the criterion groups characterized each item 
selected. The final line in Table 1 demonstrates the collapse of this key’s validity 
when applied to the next-consecutive records. 

Cureton ® has warned against assuming the validity of a test or key until its 
validity has been demonstrated on subjects other than those from which the measure 
was derived. The present note indicates the soundness of this advice. The possibility 
of constructing a recidivist key for use with M.M.P.I. seems at this point to rest with 
analysis of records of recidivists and parolees who have, after ample time, demon- 
strated law-abiding behavior. Such a procedure might serve to provide a group of 
non-recidivists free from contamination by individuals who have not become re- 
peaters only through want of opportunity. Records now being accumulated at the 
Washington State Penitentiary may form a basis for such an attempt at a later date. 
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A SIMPLE QUANTITATIVE MEASURE OF PRESSURE FOR USE IN THE 
PROJECTIVE TECHNIQUES 
CLAUDINE GIBSON WIRTHS 


University of North Carolina 


This is a brief note on the use of a measuring technique and some implications 
of the scores obtained by the use of it for quantifying the variable of pressure exerted 
by the subject in the various projective paper-and-pencil techniques. It is applica- 
ble to such tests as HTP, Bender Gestalt, group administered TAT and Rorschach, 
and it is particularly useful in the draw-a-person technique. It is in the context of 
this last application that the use of this measure is discussed in this note. 
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In essence, the method involves the multiple reproduction of the original draw- 
ing by means of carbon paper while the subject is drawing. The amount of pressure 
exerted by the subject will be reflected in the number and extent of reproductions 
that is made under standard conditions. Therefore, the number of the page on which 
the final impression is visible may be taken as a measure of (or a score which repre- 
sents) the amount of pressure exerted. 

To use this method of measuring it is of course necessary to use standard condi- 
tions for successive measurements. The most important conditions to be standard- 
ized are those of (1) weight and grade of paper, (2) carbon weight and finish, and (3) 
hardness of surface on which the drawing is made. The last point is crucial in regu- 
lating the number of carbon impressions which can be made with a given pressure. 
It is suggested that each new drawing be begun with a freshly sharpened No. 2 pen- 
cil. The various papers (usually 8 carbons are sufficient) should be stapled together 
or fastened in some way that will not hamper the individual’s ease of drawing. 

Two useful scores may be obtained by this method which enable the investiga- 
tor to make accurate comparisons between individuals such as: ‘‘individual A exert- 
ed twice as much pressure as individual B” or ‘individual A exerted uniform pressure 
while individual B exerted maximum pressure on, only 4 of the total lines drawn’, 
These two scores may be called the critical pressure score and the critical pressure 
ratio. 

The critical pressure score is the number of the page (numbering the original 
drawing 1 and counting down) held at normal reading distance on which appears 
the last-impression visible to the unaided 20/20 eye. Thus each person is given a 
score (2 or 4, ete.) which represents his maximum pressure. The individual who gets 
a score of 2 may be said to have exerted half as much pressure as one who made a 
score of 4, and a third as much pressure as one who made a score of 6. If we assume 
a one-to-one correlation between pressure and some psychological variable (or can 
discover the relationship such as a Weber fraction) the psychological variable can 
be similarly quantified. 

The critical pressure ratio gives a measure of amount of maximum pressure as 
related to original figure. The amount of line that shows through on the final crit- 
ical page may be measured by the use of a cumulative ruler and this figure related 
to the total line space drawn in the original figure. Perhaps more important than 
this ratio is the aspect of the figure that shows through on the critical page. It would 
seem that the individual who uses uniform pressure would differ considerably from 
the individual who draws a leg with such vigor that it shows through to page 8 while 
the bulk of the figure shows through only to page 6. It would then be correct to say 
that the drawing of a leg stimulated this individual to 33 1/3) greater pressure 
than did the drawing of any other aspect of the figure. 

If color is used, a similar score can be assigned to each color or to each portion 
of the figure where a color is used differently. Insufficient research has been done 
with this technique to. establish norms for any of these scores, but it would appear 
that this may also be a useful tool for those who wish to study the importance of 
color to the individual and its relation to other psychological parameters. 

Further research in developing norms for this third dimensional aspect of the 
draw-a-person technique is being carried out by the author. The results of this will 
be reported at a later date. 
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PSYCHOLOGICAL FIRST AID 


Strangely enough, psychological and mental hygiene literature has given 
relatively small attention to the first aid treatment of acute personal problems. In 
fact with the current emphasis on depth analysis, there is if anything a tendency to 
minimize presenting complaints on the grounds that when depth factors are properly 
dealt with then presenting symptoms will disappear. In our opinion, this preoccupa- 
tion with depth psychology has had a very detrimental effect in causing us to over- 
look presenting complaints which may be very distressing to the client and about 
which he urgently wishes us to do something. Without overemphasizing obvious 
analogies with medical first aid, it is immediately apparent that there are many acute 
situations in psychotherapy which truly constitute emergencies about which some- 
thing should be effectively done. Prophylactically, it is probable that many dis- 
orders could be nipped in the bud if prompt attention could be given to germinating 
seeds which may later grow into tall oaks. Diagnostically, one of our problems is to 
identify these emergency situations so that we can discriminate what needs to be 
done immediately. Therapeutically, much will be gained if the client can be made 
more comfortable even though no deep cure can be effected by first aid methods. 
This is an area of psychotherapy which is as yet relatively unexplored and which 
offers great promise. 

Because of the large variety of situations which could properly be designated 
as acute psychological emergencies, it follows that the therapist must be equipped 
with a large armamentarium of tools which can be used skillfully and eclectically 
according to the indications of each specific situation. First aid treatment by its 
very nature should be very flexible and expedient, utilizing every possible method of 
achieving results. Frequently, it will be necessary to experiment with many different 
devices in a blind shot-gun manner, hoping that something will hit the mark because 
there is no time to conduct the detailed investigations which would uncover the un- 
derlying dynamics which would indicate a more rational plan of therapy. The ther- 
apist must operate intuitively, sensitively gathering impressions about what is the 
matter, and then depending upon his skill and experience to do the right thing. Here 
is a situation which is so uniquely different that there are no established guideposts 
to follow mechanically. Only wisdom and clinical judgment can be depended upon 
to provide the proper solution. Because of the emergency nature of the situation, 
there is no time to embark upon ponderous methods or to follow rigid systems. 
Indeed, such attempts may only increase the client’s anxiety if they fail to deal 
with the things which are acutely disturbing him in the present. 

It appears that psychological first aid must utilize many of the older orthodox 
methods which are currently in disrepute with depth therapists because they operate 
primarily only upon symptomatic levels. Reassurance is probably the first aid method 
par excellance. Many clients are acutely in need of reassurance concerning their 
fears and problems. Suggestion can be used to deal with acute symptoms requiring 
immediate attention. Catharsis (or nondirective acceptance, reflection of feelings 
and clarification) may be mind-saving for people bursting over with acufe tensions. 
Persuasion, advice and other supportive methods may help the client to deal with 
acute situational problems which are beyond his resources. Unfortunately many 
therapists have never learned to use these methods effectively, while others seem to 
think it beneath their specialist’s dignity to deal with the ‘common colds’ and 
“‘bellyaches”’ of psychic life. However, insofar as any therapist fails to deal effective- 
ly with emergency problems, he is not functioning at the highest levels of professional 
competency in making the client comfortable and giving relief from anxiety and 
mental turmoil. 





EDITORIAL OPINION 211 


It would seem desirable to provide students and_ psychological interns with 
concrete instruction and practice in giving psychological! first aid. In the same 
manner as the medical student must learn to sew up cuts and to utilize lifesaving 
methods, so the clinical psychologist and counselor should learn how to deal with 
psye chological emergencies. The only way to accomplish this is to provide training 
in all known methods together w ith a thorough practical knowledge of the indica- 
tions and contraindications. The student who is not given this knowledge must 
inevitably develop feelings of professional inadequacy and anxiety because, he is not 
capable of doing all which is possible at time and place. One of the mogt serious 
criticisms of training centers which adhere exclusively to one ‘‘school’”’ is that they 
must plead guilty to turning out students who are not genuinely and eclectic ally 
well rounded in using all known methods. Such students may be very proficient 
in the methods in which they have specialized, but we know of several specific in 
stances in which they have later developed profound feelings of professional in- 
adequacy when faced with situations where they could not apply their only skills. 
The profession as a whole needs to give more detailed attention to the wide range of 
situations which its practitioners must ultimately face, and some of which are of a 
very acute nature. Training centers depending primarily on limited facilities, such 
as counseling centers for college students or vocational guidance, frequently touch 
only upon a small part of the range of problems which will actually be encountered 
in meeting actual social needs. Ideally, every clinician should be as adept at first aid 
measures as he is with depth therapy. 


F.C. T. 


REALISM IN CLINICAL SCIENCE 


Unfortunately, scientists are not always the most realistic of persons. There is 
some truth in the conception which pictures scientists as absentminded ‘“‘ivory- 
tower” people whose own lives are in a state of picturesque disorganization at the 
same moment when they are attempting to teach or influence others. This depiction 
is reinforced by the commonplace observation that scientific men are not particularly 
effective in public affairs. Indeed, the general public has learned to become wary of 
“intellectuals” who are too often unrealistic in the sense that they seem to be playing 
one string violins without proper appreciation of the global aspects of total situations. 

There is great need in the fields of clinical and applied psychological science for 
wise realism on the part of those who seek to influence social customs by scientific 
teachings. The American scene has been deeply influenced by three movements— 
Behaviorism, psychoanalysis and the child guidance movement— in directions 
which have not always been realistic and healthy. Indeed, the success or failure of 
psychological facilities and guidance clinics in all parts of the country has too fre- 
quently been determined by public acceptance or rejection of the ideological opin- 
ions expressed by the staff. In several instances known to the writer, the whole pro- 
gram of a clinic has been sabotaged by public rejection of viewpoints expressed by 
unrealistic personnel. Examples of the influence of unrealistic teachings may be 
cited with refe rence to each of the schools of thought cited above. 

Behaviorism’s most glaring inadequacies stem from its denial and failure to 
recognize the i importance of areas of behavior lying outside the realm of what can be 
dealt with by objective methods. Beginning with its denial of the realities of ‘‘mind”’, 
consciousness, volition (Will) and other forms of subjective experience, Behaviorism 
goes on to exploit the principle that all behavior is learned or conditioned, and that 
therefore mankind is literally the victim of its past conditionings. This form of 
mechanistic and materialistic determinism has led in medical jurisprudence to the 
legal defense that the guilty person is unable to control his actions which are the 
unavoidable and involuntary consequence of his past conditionings. Behavioristic 
theory also leads to the conclusion that training in self-control and the development 
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of volition are superfluous and impossible of achievement. In its insistence on the 
importance of environment as the principal determiner of behavior, it has led to the 
unrealistic attempt to solve all human problems by environmental manipulations 
on the premise that conditioning is the only way to influence human behavior. At 
the most ridiculous extreme of unrealistic application of objectivism, we have the 
rat psychologist who seeks to interpret all of human behavior on the basis of over- 
simplifications derived from studies confined exclusively to subhuman species. 

The child guidance movement has also been associated with a good deal of un- 
realistic and at times Pollyannaish theorizing on the part of childless workers whose 
orientations show slavish devotion to prevailing psychological fads. For example, 
the at times exaggerated and very unrealistic insistence on complete acceptance and 
permissiveness in child handling which establishes no limits no matter how uncon- 
trolled and asocial the child’s behavior becomes. Thus the admonition that the par- 
ent or teacher should “love” a child even though it may be completely unlovable. 
Psychologists have learned the futility of telling clients to “stop worrying” or ‘“‘don’t 
be afraid because there’s nothing to be afraid of” but some of them seem not to real- 
ize that “love” cannot be legislated either. Indeed, many of us have concepts of 
childhood far removed from a clear understanding of just what children are psycho- 
biologically. At the present time, child psychologists place great emphasis on the 
socialization of the child to the point where great anxiety may be stimulated in the 
introverted shy child who may be constitutionally unable to adjust under the social 
pressures to which he is exposed. The group “‘ego-ideal” of the current American 
scene appears to be that of a super-vivacious extravert who masters all interpersonal 
problems with a glamorous smile charged with good feeling and who freely expresses 
emotions whether they really exist or not. Far from learning to be himself, the aver- 
age child simply becomes adept in assuming such affectations as may assume collo- 
quial popularity and-which do not contain much genuine feeling. One has only to 
visualize the straddle-legged, arms akimbo, head jauntily twisted, ivory toothed 
smile of the average mode! pictured in any American magazine to know what we are 
talking about. 

And finally, the traipses into pseudoscientifie phantasy which have so often char- 
acterized the psychoanalytic movement. A few decades ago, cartoons characterized 
the “‘bug”’ scholar who explored darkest Africa armed with a topheavy collection of 
bottles and nets. Today it is the goateed psychiatrist or analyst, armed with hyp- 
notic stare and couch, who dredges into the depths of the unconscious to bring forth 
loads of repressed complexes striving to wriggle back into the protection of the un- 
conscious. One can get some idea of the relative scientific respectability of the fields 
of psychoanalysis, psychiatry and scientific psychology by comparing the objectivity 
of method used in reporting papers in these respective fields. To date, psychoanaly- 
tic writings are almost completely innocent of scientific methodology, psychiatric 
journals contain materials which are perhaps 50°; scientifically oriented while almost 
all psychological journals accept only papers which are experimentally or statisti- 
cally oriented. 

What will be the answer to our current state of confusion? In our opinion, one 
of the greatest needs of the present psychological scene is the rapid collection of 
normative data concerning all aspects of human behavior. The Kinsey reports con- 
stitute a pilot study showing what may be accomplished in the rapid accumulation 
of data to either prove or disprove mooted theoretical points. It is hoped that other 
scientists will dedicate their lives to the collection of data in specific fields. We men- 
tion whole lives because it appears that full justice will only be done in the various 
potential areas of research when full time attention is dedicated to them. The public 
opinion polls might contribute a valuable service by systematically sampling the 
whole population on matters of general psychological import. It is only through the 
systematic collection of large scale data that many of our most pressing problems 
will be solved. 
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Professor of Psychology at Adelphi College. For the purposes of illustrating the 
dynamics of psychological testing, he selected all the cases referred for appraisal in a 
single month at Hillside Hospital and here reproduces all the data together with 
interpretive diagnostic formulations. Dr. Gurvitz has a genuinely eclectic viewpoint 
and this book is more scientifically objective than other recent books in the field. 
His interpretations of personality dynamics are based solidly on test data rather than 
upon unsupported speculations. This book will make a valuable reference book for 
courses in diagnostic testing. 


Srrecker, Epwarp A., EBaAuGH, FRANKLIN G., and Ewaxt, Jack R. Practical 
Clinical Psychiatry. Philadelphia: Blakiston, 1951. Pp. 506. Seventh edition. 


{uESCH, JURGEN. Chronic Disease and Psychological Invalidism. Berkeley: Uni- 
versity of California Press, 1951. Pp. 191. $3.50. 

In this monograph, a study is made of personality factors contributing to 
psychological invalidism. The investigation of delayed recovery included a group of 
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of the difficulty results from its rather unusual plan of organization. The book really 
is two volumes in one, the two halves having quite different objectives. Volume 
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ercises drawing eclectically from the psychology of perception, general semantics, 
Yoga and psychoanalytic mental mechanisms. Even though this first section is ap- 
parently intended to be read and applied by laymen to themselves, the presentation 
is written on quite a difficult level due to the authors’ effort to keep scientifically 
oriented by introducing their concepts in a framework of Gestalt psychology. It is 
questionable whether laymen will comprehend the complexities of theory which are 
introduced throughout the text except under the tutelage of a competent psycho- 
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popularize their methods before the system has been exposed to an intensive scienti- 
fic evaluation by the profession. 

The basic method underlying the system of tasks is to improve the efficiency 
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ing-its-environment by means of finer perceptual discriminations, symbolic repre- 
sentations and self-signalling cues. In some respects, modern objective psychology 
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ment. Therapy ts directed toward achieving “strong” Gestalts, i.e. to secure opti- 
mum figure-ground relationships. An attempt is made to interpret the data of 
classical psychology and psychoanalysis in terms of Gestalt field theory. Since most 
of the presentation is purely speculative and unsupported by research data, the 
reader will keep in mind that many of the conclusions represent stimulating hy- 
potheses rather than proven facts. 
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